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Abstract. Over the years, numerous experiments have been accumulated to show 
that cooperation is not casual and depends on the payoffs of the game. These findings 
■ suggest that humans have attitude to cooperation by nature and the same person may 

^S) ' act more or less cooperatively depending on the particular payoffs. In other words, 

people do not act a priori as single agents, but they forecast how the game would be 
^ l' played if they formed coalitions and then they play according to their best forecast. 

In this paper we formalize this idea and we define a new solution concept for 
one-shot normal form games. 

We prove that this cooperative equilibrium exists for all finite games and it explains 
a number of different experimental findings, such as (1) the rate of cooperation in the 
Prisoner's dilemma depends on the cost-benefit ratio; (2) the rate of cooperation in the 
' Traveler's dilemma depends on the bonus/penalty; (3) the rate of cooperation in the 

Publig Goods game depends on the pro-capite marginal return and on the numbers 
of players; (4) the rate of cooperation in the Bertrand competition depends on the 
C/3 ' number of players; (5) players tend to be fair in the bargaining problem; (6) players 

tend to be fair in the Ultimatum game; (7) players tend to be altruist in the Dicta- 
tor game; (8) offers in the Ultimatum game are larger than offers in the Dictator game. 
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1. Introduction 



Since its foundation by Morgenstern and von Neumann |Mo-vN4"4] . the major chal- 
lenge of modern game theory has been to predict which actions a human player would 
adopt in a strategic situation. A first prediction was proposed in an earlier paper by 
J. von Neumann |vN28j for two-person zero-sum games and then generalized to every 
finite game by J. Nash in |Na50aj . Since then Nash equilibrium has certainly been the 
most notable and used solution concept in game theory. Nevertheless, over the last sixty 
years, it has been realized that it makes poor predictions of human play and, indeed, a 
large number of experiments have been conducted on games for which it drammatically 
fails to predict human behavior. 

There are many reasons behind this failure. On the one hand, when there are multiple 
equilibria, it is not clear which one we should expect is going to be played. A whole 
stream of literature, finalized to the selection of one equilibrium, arose from this point, 
including the definitions of evolutionarily stable strategy [MS-Pr73] . perfect equilibrium 
[Se75j ■ trembling hand perfect equilibrium |Se75j . proper equilibrium |My78| , sequential 
equilibrium [Kr-Wi82] . limit logit equilibrium |MK-Pa95] . and, very recently, settled 
equilibrium | My- We 12] . 

On the other hand, the criticism of Nash equilibrium is motivated by more seri- 
ous problems: there are examples of games with a unique Nash equilibrium which is 
not played by human players. Typical examples of such a fastidious situation are the 
Prisoner's Dilemma [ F152j . the Traveler's Dilemma |Ba94| . and, more generally, every 
social dilemma [Ko8 8.j . This point has motivated another stream of literature devoted 
to the explanation of such deviations from Nash equilibria. Part of this literature tries 
to explain such deviations assuming that players make mistakes in the computation 
of the expected value of a strategy and therefore, assuming that errors are identically 
distributed, a player may also play non-optimal strategies with a probability described 
by a Weibull distribution. This intuition led to the foundation of the so-called quantal 
response equilibrium theory by McKelvey and Palfrey |MK-Pa95] . A variant of this the- 
ory, called quantal level-k theory and proposed by Stahl and P. Wilson in [St-Wi94) . was 
recently shown to perform better in the prediction of human behavior |Wr-LB10] . In 
the same paper, Wright and Leyton-Brown have also shown that quantal level-k theory 
predicts human behavior significantly better than all other behavioral models that have 
been proposed in the last decade, as the level-k theory |CG-Cr-Br01j and the cognitive 
hierarchy model |Ca-Ho-Ch04) . However, an obvious criticism of quantal level-k theory 
is that it is not scale invariant, contradicting one of the axioms of expected utility theory 
of Morgenstern and von Neumann [ Mo-vN47j . A perhaps more fundamental criticism 
stems from the fact that quantal level-k theory only makes use of some parameters de- 
scribing either the incidence of errors that a player can make computing the expected 
utility of a strategy or the fact that humans can perform only a bounded number of 
iterations of strategic reasoning. These features first imply that quantal level-k theory 
is not predictive, in the sense that one has to conduct experiments to estimate the 
parameters; second, they imply that quantal level-k theory intrinsically affirms that de- 
viation from Nash equilibria can descend only from two causes, computational mistakes 
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and bounded rationality, that are hard to justify for games with very easy payoffs, hke 
the Prisoner's Dilemma, or for games where the deviation from Nash equilibrium is 
particularly strong, like the Traveler's Dilemma with small bonus-penalty. 

Indeed, the general feeling is that the motivation must rely somewhere deeper and 
that Nash equilibrium should be replaced by a conceptually different solution concept 
that takes into account other features of human behavior and coincides with Nash equi- 
librium only in particular cases. The first studies in this direction have been presented 
by Renou and Schlag |Re-Sc09] and Halpern and Pass |Ha-Pal2] . by Halpern and Rong 
|Ha-RolO| . by Halpern and Pass |Ha-Pall] . by Jamroga and Melissen |Ja-Mell] . and 
by Adam and Ehud Kalai [ Ka-Kal3] . Nevertheless, even though these solution con- 
cepts can explain deviations from Nash equilibria in some particular games, all of them 
make unreasonable predictions for many games of interest. For instance, the maxi- 
mum perfect cooperative equilibrium introduced in [Ha-RolO] is too rigid and predicts 
cooperation for sure in the Prisoner's and Traveler's Dilemmas, contradicting the ex- 
perimental data collected in |Ca-Go-Go-Ho99] . |Go-Ho01j . |Be-Ca-Na05] . |Ba-Be-Stll] . 
[HRZllj . |DEJR12| . |Fu-Ra-Drl2] . |RGN12] . The iterated regret minimization proce- 
dure introduced in [ Re-Sc09] and |Ha-Pal2] can explain deviations towards cooperation 
in some variants of the Traveler's Dilemma, the Bertrand competition, the Centipede 
Game, and other games of interest, but it does not predict deviation towards coop- 
eration in the Prisoner's Dilemma [HBZllj . |DE.TR12j . |Fu-R.a-Drl2"] . |BGN12j and in 
the public good game |Le95] . it cannot explain altruistic behaviors in the ultimatum 
game |Fe-Sc99] and in the dictator game [Enllj . and makes unreasonable predictions 
for the Traveler's dilemma with punishment (see Example 15. lip , and a certain zero- 
sum game (see Example 18. 3p . The solution concept defined using algorithmic ratio- 
nability in |Ha-Pall] can explain deviation towards cooperation in the iterated Pris- 
oner's and Traveler's dilemmas, but it does not predict deviation towards cooperation 
in one-shot versions of the Prisoner's dilemma or in one-shot versions of the Traveler's 
dilemma with very small bonus-penalty, contradicting the experimental data reported 
in [ Go-Honi| . |Be-(]a-Nan5j . |HRZ11|, [D"E.TR12j , |Fu-R,a-Drl2| . [RGNT2]. The far- 
sighted pre- equilibrium introduced in [Ja-Mell] is too rigid. For instance, the Prisoner's 
dilemma has two farsighted pre-equilibria, which coincide with Rabin's fairness equilib- 
ria [Ra93j ■ where both players either cooperate or defect for sure. This contradicts the 
experimental data reported in [HRZllj . |DEJR12j . |l^\i-Ra-Drl2] . |RGN12j . which sug- 
gest that humans tend to play a mixed strategy. Finally, the coco value introduced by 
Adam and Ehud Kalai in |Ka-Kal3] . unifying and developing previous works by Nash 
[Na53j . Raiffa |Rai53j . and E.Kalai-Rosenthal |Ka-Ro78] . also appears to be too rigid. 
For instance, if two agents played the Prisoner's dilemma according to the coco value, 
then they would both cooperate for sure. This prediction contradicts the experimental 
data collected in [HRZllj . [DEJR12j . [i^\i-Ra-Drl2] . [RGN12j . 

In this paper we try to attribute the failure of all these attempts to two basic problems. 

The first problem is the use of utility functions in the very definition of a game. In- 
deed, the experimental evidence have shown that expected utility theory fails to predict 
the behavior of decision makers [A153j . [Ka-TvOO] . [StOOj . 

This problem could be theoretically overcome replacing utility functions with gain 
functions and applying Kahneman-Tversky's cumulative prospect theory [Tv-Ka92j . 
But one can easily convince himself that in most cases such a replacement could explain 
only quantitative deviations. 
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The second problem is indeed that experiments conducted on the Prisoner's dilemma, 
the Traveler's dilemma, Dictator game, and other games, show qualitative deviations 
from classical solution concepts. These qualitative deviations suggest that humans are 
altruistic and have attitude to cooperation. 

These observations motivate the definition of a new solution concept, able to take into 
account altruism and cooperation and using gain functions instead of utility functions. 
This paper represents a first endeavour in this direction. Indeed, here we consider 
only one-shot normal form games where the players are completely anonymous, that 
is, they do not know each other and they are not allowed to exchange informatioij^. 
The aim of this paper is to define a new solution concept for this class of games. This 
solution concept will be called cooperative equilibrium. Indeed, we will see that altruism 
plays only a marginal role and the main idea behind this new equilibrium notion is the 
formalization of the following principle of cooperation: 

(C) Players try to forecast how the game would be played if they formed coalitions 
and then they play according to their best forecast. 

The study of cooperation in games is not a new idea. Economists, biologists, psy- 
chologists, sociologists, and political scientists, have been studying cooperation in social 
dilemmas for forty years. These classical approaches explain tendency to cooperation 
dividing people in proself and prosocial types p84], [LWVW86], [KMM86J, [KCC86], 
|ML88 ]. or appealing to forms of external control [0165j . |Ha68j, [DaSO] . or to long- 
term strategies in iterated games [Ax84j . But, over the years many experiments have 
been accumulated to show cooperation even in one-shot social dilemmas without exter- 
nal contro l |Is-Wa88j . |Co-DJ-Fo-Ro96] . |Go-Ho01j, fBe-Ca -NaOS], [DRFN08], [HRZTT], 
IDEJR12] . These and other earlier experiments |Ke-Gr72) . [BSKM76j, [KSKSOj, [IWT^ 
have also shown that the rate of cooperation in the same game depends on the partic- 
ular payoffs, suggesting that most likely humans cannot be merely divided in proself 
and prosocial types, but they are engaged in some sort of indirect reciprocity |No-Si98j . 
[No06j and the same person may behave more or less cooperatively depending on the 
payoffs. In other words, humans have attitude to cooperation by nature. 

To the best of our knowledge, this is the first attempt to lift this well known tendency 
to cooperate up to a general principle which is nothing more than a deeper and smarter 
realization of selfishness. 

The idea to formalize the principle of cooperation and define the cooperative equilib- 
rium can be briefly summarized as follows: 

• We assume that players do not act a priori as single players, but they try to 
forecast how the game would be played if they formed coalitions. 

• Each forecast is represented by a number Vi{p), called value of the coalition 
structure p for player i, which is a measure of the expected gain of player i 
when she plays according to the coalition structure p. 

• The numbers Vi{p) induce a sort of common beliefs: we consider the induced 
game lnd{G,p) which differs from the original game G only for the set of allowed 
profiles of mixed strategies: the profiles of mixed strategies allowed in lnd{Q,p) 
are the profiles {ai, . . . , ctat) such that Ui{ai, cr at) > for any player i. 



^We mention that anonimity is not really a necessary assumption: the effect of any sort of contact 
among the players would be a different evaluation of the so-called prior probability r. The point is that 
at the moment it is not clear how this prior probability should be re-evaluated. 
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• The exact cooperative equilibrium is one where player i plays an equilibrium of 
the game lnd{Q,p) induced by a coalition structure which maximizes the value 
function uj^. 

• The notion of equilibrium for the induced game Ind(^,p) is not defined using 
classical Nash equilibrium, but using a prospect theoretical analogue. 

In order to apply prospect theory we must replace utility functions by gain functions, 
that are, functions whose values represent the monetary outcomes or, more generally, the 
quantity of some good which is won or lost by a player. This replacement comes at the 
price that we must take into account explicitly new data that were implicitly included in 
the utility functions. Indeed, while utility functions were supposed to contain all relevant 
information about players' preferences, gain functions do contain only the quantity of 
some good which is won or lost by the players. These new data include the fairness 
functions fi and the altruism functions Uij. An interesting feature of the cooperative 
equilibrium is that, in many games of interest, it does not depend on these functions. 
This implies that the cooperative equilibrium is a predictive solution concept for many 
games of interest. A bit more precisely, in this paper we prove the following statements. 

Fact 1.1. The cooperative equilibrium for the Prisoner's dilemma is predictive (i.e., 
it does not depend on fairness functions and altruism functions) and has the following 
property: the predicted rate of cooperation increases as the cost-benefit ratio increases. 

Fact 1.2. The cooperative equilibrium for the Traveler's dilemma is predictive and has 
the following property: the predicted rate of cooperation decreases as the bonus/penalty 
increases. 

Fact 1.3. The cooperative equilibrium for the Bertrand competition is predictive and it 
has the following property: the predicted rate of cooperation decreases as the numbers 
of players increase. 

Fact 1.4. The cooperative equilibrium fits Kahneman-Knetsch- Thaler's experiment 
related to the ultimatum game. 

Fact 1.5. The cooperative equilibrium for the public good game is predictive and it has 
the following properties: (1) the predicted rate of cooperation increases as the marginal 
return increases, and (2) the predicted rate of cooperation decreases as the number 
of players increases and then increases again as the number of players gets sufficiently 
large. 

Fact 1.6. The cooperative equilibrium predicts the (50,50) solution in the Bargaining 
problem under natural assumptions on the fairness functions. 

Roughly speaking, the natural assumption is that the two players have the same 
perception of money. We believe that this assumption is natural, since it is predictable 
that a bargain between a very rich person and a very poor person can have a different 
solution. 

Fact 1.7. The cooperative equilibrium explains the experimental data collected for the 
dictator game, via altruism. 

o 

The word exact means that, since players can have bounded rationahty or can make mistakes in 
the computations, one can also define a quantal cooperative equilibrium borrowing ideas from quan- 
tal response equilibrium and quantal level-k theory and say that player i plays with probability 
gA«.(p)/^^gA..(p) a quantal response equilibrium or a quantal level-k equilibrium of the game Ind(t/, p). 
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This happens just because we define the altruism in terms of human behavior in the 
dictator game. To treat the dictator game as the quintessence of altruism is certainly 
not a new idea |Ha-KrOOj . |BEJNllj . |DFR11] . 

Fact 1.8. The cooperative equilibrium explain the experimental data collected for the 
ultimatum game, via a combination of cooperation and altruism. 

In particular, the observation that offers in the ultimatum game are larger then the 
offers in the dictator game is explained in terms of cooperation, which is generated by 
the fact that the responder has the power to reject proposer's offer. 

Another case where the cooperative equilibrium is only descriptive is when the mis- 
takes that players can make in the computations have a very strong influence on the 
result. A typical example is the following. 

Fact 1.9. The quantal cooperative equilibrium explains Goeree-Holt's experiment on 
the asymmetric matching pennies. 

The structure of the paper is as follows. In Section [21 we define the so-called games 
in explicit form (see Definition 12. 3p . where the word explicit really emphasize the fact 
that we have to take into account explicitly new data (altruism functions and fairness 
functions). In Section [3] we describe informally the idea through a simple example that 
allows to motivate all main definitions of the theory. In Section H] we define the coopera- 
tive equilibrium for games in explicit form under expected utility theory, that is, without 
using cumulative prospect theory, and without using the altruism functions (see Defini- 
tion I4.14P . The reason of this choice is that in most cases cumulative prospect theory 
can change predictions only quantitatively and not qualitatively and that, in most cases, 
altruism functions do not play any active role. Indeed, we compute the cooperative equi- 
librium (under expected utility theory and without using the altruism functions) for the 
Prisoner's Dilemma (see Examples 14.51 and 15. 2|) . Traveler's Dilemma (see Examples 14.61 
and 15. 1|) , Nash bargaining problem (see Example 14.41 and 15. 9|) , Bertrand competition 
(see Example 15. 4p . public goods game (see Example 1 5. 7p . the ultimatum game (see Ex- 
ample and a specific game of particular interest since iterated regret minimization 
theory fails to predict human behavior, whereas the cooperative equilibrium does (see 
Example l5.1ip . We make a comparison between the predictions of the cooperative equi- 
librium and the experimental data and we show that they are always close. In Section 
[6] we discuss a few examples where the replacement of expected utility theory by cumu- 
lative prospect theory starts playing an active role (see Examples 16.11 and 16. 2p . Here it 
starts the ideal second part of the paper, devoted to the definition of the cooperative 
equilibrium for games in explicit form, using cumulative prospect theory and taking into 
account altruism. Before doing that, we take a short section, namely Section [71 to give 
a brief introduction to cumulative prospect theory. The definition of the cooperative 
equilibrium under cumulative prospect theory and taking into account altruism takes 
Sections [8] and [9) in the former we define a procedure of iterated deletion of strategies 
using the altruism functions and we apply it to explain the experimental data collected 
for the dictator game (see Example 18. lOp and the ultimatum game (see Example 18. lip : 
in the latter we repeat the construction done in Section [31 this time under cumulative 
prospect theory instead of expected utility theory. Theorem 19.61 shows that all finite 
games have a cooperative equilibrium. Part of Section [8] may be of intrinsic interest, 
since it contains the definition of super-dominated strategies (see Definition 18. ip and 



Joseph Halpern communicated to the author that he and Rafael Pass have independently introduced 
super-dominated strategies (under the name minimax dominated strategies) in [Ha-Pal3| . 
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their application to solve a problem left open in |Ha-Pal2j (see Example . Section 
[To] states a few important problems that should be addressed in future researches. 

2. Utility functions vs Gain functions: games in explicit form 

As mentioned in the Introduction, a major innovation that we propose is the use 
of gain functions instead of utility functions. In this section we first elaborate on the 
reasons behind this choice and then we investigate the theoretical consequences of such 
a choice. First recall the classical definition of a game in normal form. 

Definition 2.1. A finite game in strategic or normal form is given by the following 
data: 

• a finite set of players P = {1, 2, . . . , A^}; 

• for each player i G P, a finite set of strategies Si; 

• for each player i G P, a preference relation <j on 5" := 5i x . . . x Sjy- 

It is frequently convenient (and very often included in the definition of a game) to 
specify the players' preferences by giving real- valued utility functions Ui : S ^ R. that 
represent them. The definition and the use of utility functions relies in Morgenstern 
and von Neumann's expected utility theory [Mo-v N4"7] . where, to avoid problems such 
as risk aversion, they assumed that players' utility functions contain all relevant in- 
formation about the players' preferences over strategy profiles. In this way, Nash was 
then able to formalize Bernoulli's principle that each player attempts to maximize her 
expected utility [Be73 8] given that the other players attempt to do the same. The use of 
utility functions can certainly make the theory much easier, but it is problematic, since 
it has been observed that humans constantly violate the principles of expected utility 
theory. The very first of such examples was found by M. Allais in [A153] and many 
others are known nowadays (see, for instance, |Ka-TvOO] and |StOO] for a large set of 
examples). For the sake of completeness, we briefiy describe one of these experiments 
(see |Ka-Tv79] , Problems 3 and 4) . In this experiment 95 persons were asked to choose 
between: 

LI. A lottery where there is a probability of 0.80 to win 4000 and 0.20 to win nothing, 
L2. A certain gain of 3000. 

An expected utility maximizer would choose the lottery LI. However, Kahneman 
and Tversky reported that 80 per cent of the individuals chose the certain gain. The 
same 95 persons were then asked to choose between: 

LI'. A lottery with a 0.20 chance of winning 4000 and 0.80 of winning nothing, 
L2'. A lottery with a 0.25 chance of winning 3000 and 0.75 of winning nothing. 

This time 65 per cent of the subjects chose the lottery LI', which is also the lottery 
maximizing expected utility. These two results contradict the so-called substitution 
axiom in expected utility theory and show how people can behave as expected utility 
maximizers or not depending on the particular situation they are facing. 

An even more dramatic observation is that the evidence suggests that decision makers 
weight probabilities in a non-linear manner, whereas expected utility theory postulates 
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that they weight probabiUties hnearly. Consider, for instance, the following example 
from [Ka-Tv79] . p. 283. Suppose that one is compelled to play Russian roulette. One 
would be willing to pay much more to reduce the number of bullets from one to zero 
than from four to three. However, in each case, the reduction in probability of a bullet 
ring is 1 /6 and so, under expected utility theory, the decision maker should be willing to 
pay the same amount. One possible explanation is that decision makers do not weight 
probabilities in a linear manner as postulated by expected utility theory. 

These problems have been now overcome in decision theory thanks to the celebrated 
prospect theory |Ka-Tv79] and cumulative prospect theory |Tv-Ka92] . One of the very 
basic principles of (cumulative) prospect theory is that decision makers think in terms 
of gains and losses rather than in terms of their net assets; in other words, they think 
in term of gain functions rather than in terms of utility functions. This forces us to 
replace utility functions by gain functions. This replacement comes at a price: while 
utility functions were supposed to contain all relevant information about the players' 
preferences, gain functions do not contain such information. They must be taken into 
account separately. As we will remind in Section [TJ risk aversion is taken into account 
by cumulative prospect theory. Among the remaining relevant information there are (at 
least) two deserving particular attention: 

Altruism. A player may prefer to renounce to part of her gain in order to favor 
another player. 

Perception of gains. Two different players may have different perceptions 
about the same amount of gain. 

To define formally a game in terms of gain functions, we introduce a unit of mea- 
surement Q (tipically one dollar, one euro ...) and postulate that to every action profile 
s G S and to every player i G P is associated a quantity gi{s) of g which is lost or won by 
player i when the strategy profile s is played. We assume that the unit of measurement 
(e.g., the currency) is common to all players. The losses are expressed by negative inte- 
gers and the wins by positive integers, so that gi{s) = 2 will mean, for instance, that, if 
the strategy profile s is played, then player i wins two units of the good g; analogously, 
gi{s) = —3 will mean that, if the strategy profile s is played, then player i loses three 
units of the good g. 

Using the unit of measurement, we can take into account altruism and perception of 
gains as follows. 

Definition of the altruism functions. We define a notion of altruism opera- 
tionally, that is, the altruism functions can be theoretically computed running a pre- 
experiment. Consider a general dictator game as follows. A proposer has an endowment 
of y € N units of g and a responder has got already z € Z units of g. Let A; > 0, the 
proposer chooses x £ {0,1, . . . ,y}, to transfer to the responder, who gets [kx\, that is, 
the largest integer smaller than or equal to kx. In other words, we define the two player 
game Dict(A;, y, z) where the strategy set of the first player is 5i = {0, 1, . . . , y} and the 
strategy set of the second player contains only one strategy, that we call A. The gain 
functions are 

gi{x,A) = y — x and g2{x. A) = z + Ykx\. 

Definition 2.2. The altruism function aij is the function a^j : x N x Z ^ N such 
that aij{k,y,z) would be the offer of player i to player j if i were the proposer and j 
were the responder in Diet (A;, y, z). 
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Definition of the fairness functions. To capture perception of money, we assume 
that to each player i £ P is associated a function /j : {{x,y) € : x > y} — )• [0, oo) 
whose role is to quantify how much player i disappreciates to renounce to a gain of x 
and accept a gain of y. The following are then natural requirements: 

• fi is continuous, 

• if X > y, then fi{x, y) > 0, 

• ifx = y, then fi{x, y) = 0, 

• for any fixed x > 0, the function fi{x,-) is strictly decreasing and strictly convex 
for positive y's and strictly concave for negative y's, 

• for any fixed y, the function y) is strictly increasing and strictly concave for 
positive x's and strictly convex for negative x's. 

The last two properties formalize the well-known diminishing sensitivity principle 
|Ka-Tv79] : the same difference of gains (resp. losses) is perceived smaller if the gains 
(resp. losses) are higher. Indeed, one possible way to define the functions fi is to use 
Kahneman-Tversky's value function v and set fi{x,y) = v{x) — v{y). The problem of 
this definition is that it does not take into account that different players may have dif- 
ferent perception of the same amount of money (think of the perception of 100 dollars 
of a very rich person and a very poor person). 

Therefore, we are led to study the following object. 

Definition 2.3. A finite game in explicit form Q = 0{P, S,Q, g,a, f) is given by the 
following data: 

• a finite set of players P = {1,2, . . . , N}; 

• for each player i £ P, a finite set of strategies Sf, 

• a good q, which plays the role of a unit of measurement; 

• for each player z € P, a function gi : Si x . . . x Sat — > Z, called gain function; 

• for each pair of players {i,j), i 7^ j, an altruism function Uij; 

• For each player i G P, a fairness function fi : {{x,y) £ M? : x > y} ^ M. 
verifying the properties above. 

The terminology explicit puts in evidence the fact that we must take into account 
explicitly all parameters that are usually considered implicit in the definition of utility 
functions. We are not saying that there are only three such parameters (altruism func- 
tions, fairness functions, and risk aversion) and this is indeed the first of a long series of 
points of the theory deserving more attention in future researches. In particular, there 
is some evidence that badness parameters can play an important role in some games. 
We shall elaborate on this in Section [TOj 

The purpose of the paper is to define a solution concept for games in explicit form 
taking into account altruism and cooperation and using cumulative prospect theory 
instead of expected utility theory. Nevertheless, we will see that 

• in most cases the use of cumulative prospect theory instead of expected utility 
theory can change predictions only quantitatively and not qualitatively; 

• in most cases the altruism functions do not play any active role, since there are 
no players having a strategy which give a certain disadvantage to other players. 

Consequently, we prefer to introduce the cooperative equilibrium in two steps. In the 
first one we keep expected utility theory and we do not use the altruism functions. The 
aim of the first step is only to formalize the principle of cooperation. We show that 
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already this cooperative equilibrium under expected utility theory and without altruism 
can explain experimental data satisfactorily well. In Section [6] we discuss some examples 
where the cooperative equilibrium under expected utility theory does not perform well 
because of the use of expected utility theory and because we did not take into account 
altruism and so we move towards the definition of the cooperative equilibrium under 
cumulative prospect theory and taking into account altruism. 



In this section we describe the cooperative equilibrium (under expected utility theory 
and without taking into account altruism) starting from an example. The idea is indeed 
very simple, even though the complete formalization requires a number of preliminary 
definitions that will be given in the next section. 

Consider the following variant of the Traveler's dilemma. Two players have the same 
strategy set Si = S2 = {180, 181, . . . , 300}. The gain functions are 



The usual backward induction implies that (180, 180) is the unique Nash equilib- 
rium. Nevertheless, numerous experimental studies reject this prediction and show that 
humans play significantly larger strategies. 

In the cooperative equilibrium, we formalize the idea that players forecast how the 
game would be played if they formed coalitions and then they play according to their 
best forecast. 

Let us try to describe how this idea will be formalized. In a two-player game, as the 
Traveler's dilemma, there are only two possible coalition structures, the selfish coalition 
structure ps = ({1}, {2}) and the cooperative coalition structure pc = ({1,2}). Let us 
analyze them: 

• If agents play according to the selfish coalition structure, then by definition they 
do not have any incentive to cooperate and therefore they would play the Nash 
equilibrium (180, 180). A Nash equilibrium is, by definition, stable, in the sense 
that no players have any incentives to change strategy. Consequently, both 
players would get 180 for sure. In this case we say that the value of the selfish 
coalition structure is 180 and we write v{ps) = 180. 

• Now, let us analyze the cooperative coalition structure pc- The largest gain for 
each of the two agents, if they play together, is to get 300, that is attained 
by the profile of strategies (300,300). Nevertheless, each player knows that 
the other player may defect and play a smaller strategy and so the value of 
the cooperative coalition is not 300, but we have to take into account possible 
deviations. Let us look at the problem from the point of view of player 1. The 
other player, player 2, may deviate and play the strategy 299 or the strategy 
298, or the strategy 297, or the strategy 296, or the strategy 295 (indeed, all 
these strategies give at least the same gain as the strategy 300, if the first player 
is believed to play the strategy 300). In this case, the best that player 2 can 
obtain is 304 (if she plays 299 and the first player plays 300) and so we say 
that the incentive to deviate from the coalition is 304 — 300 = 4. We denote 
this number by 1)2 (pc)- Now, if player 2 decides to deviate from the coalition. 



3. An informal sketch of the definition 




and 
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she or he incurs in a risk due to the fact that also player 1 can deviate from 
the coalition either to follow selfish interest or because player 1 is clever enough 
to understand that player 2 can deviate from the coalition and then player 1 
decides to anticipate this move. The maximal risk that player 2 incurs trying 
to achieve her maximal gain is then attained when player 2 deviates to 299 and 
player 1 anticipates this deviation and play 298. In this case, player 2 would 
gain (^2(298,299) = 293. So we say that the risk in deviating from the coalition 
structure Pc is i?2(Pc) = 300 — 293 = 7. We now interpret the number 

D2{pc) 4 
= DM + RM = IT' 
as a sort of prior probability that player 1 assigns to the event "'player 2 abandons 
the coalition structure pc" ■ Consequently, we obtain also a number 

TldP<^) = 1 -n,{2}(Pc), 

which is interpreted as a prior probability that player 1 assigns to the event 
"nobody abandons the coalition structure pc" ■ 

This probability measure will be now used to weight the numbers emj^pc), 
representing the infimum of gains that player 1 receives if nobody abandons the 
coalition, and ei^{2}iPc), representing the infimum of gains that player 1 receives 
if the second player abandons the coalition. Therefore, one has 

eifiiPc) = 300 and ei^{2}(pc) = 290, 

where the second number comes from the fact that the worst that can happen 
for player 1 if the second player abandons the coalition and the first players 
does not abandon the coalition is in correspondence of the profile of strategies 
(300, 295) which gives a gain 290 to the first player. Taking the average we 
obtain the value of the cooperative coalition for player 1 

7 4 
vi(p^ = 300 • — + 290 296.35. 

By symmetry one has V2{ps) = vi{ps) ='■ v{ps) = 180 and V2{pc) = vi{pc) =■ v{pc) = 
296.35. So one has v{ps) < v{pc) and then the cooperative equilibrium predicts that 
the agents play according to the cooperative coalition structure, since it gives a better 
forecast. The meaning of the word play according to pc has to be clarified. Indeed, since 
the profile (300, 300) is not stable, we cannot expect that the players play for sure the 
strategy 300. What we do is to interpret the values Vi{pc) as a sort of common beliefs: 
players simply keep only the profiles of strategies a = ((Ti,(T2) such that gi{cj) > t'i(pc) 
and 52(c) > V2{pc)- Computing the Nash equilibrium in this induced game will give 
the cooperative equilibrium of the game that, in this case, is a mixed strategy which 
is supported between 296 and 297. Observe that this is very close to the experimental 
data. Indeed, the one-shot version of this game was experimented by Goeree and Holt 
who reported that 80 per cent of subjects played a strategy between 290 and 300 with 
an average of 295 (see |Go-Ho01| ). 

The purpose of the next section is to formalize the idea that we have just described. 
Indeed, even though the idea is very simple and in many relevant cases computations 
can be easily performed by hand (cf. Section [S]), the correct formalization requires the 
whole section m because of the following technical problems: 
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• In the particular example that we have just described, the cooperative coalition 
structure leads to a one-player game with a unique Nash equilibrium, which is 
(300, 300). In general this will not happen and we should take into account that 
one Nash equilibrium can be less fair than another. For instance, the cooperative 
coalition structure in Nash bargaining problem leads to a one-player game with 
many Nash equilibria, but intuitively only the (50,50) solution is fair. 

• The definition of deviation and risk is intuitively very simple, but the general 
mathematical formalization is not straightforward. 

4. The cooperative equilibrium under expected utility theory 

Let Q = GiP, S, 0, g, a, f) be a finit^ game in explicit form. As usual, to make nota- 
tion lighter, we denote S-i the cartesian product of all the Sj^s but Si. Let V{X) be the 
set of probability measures on the finite set X. If a = (cJi, . . . , ajy) G T'{Si)x. . .xViS^), 
we denote by a-i the (A''— l)-dimensional vector of measures (o"i, . . . , cij-i, dj+i, . . . , ajy) 
and, as usual in expected utility theory, we set 

gj{ai,a-i) = gj{a) := ^ gj{si, . . . , SN)cri{si) ■ . . . ■ aNisN)- 
(si,...,S]v)G5 

Conversely, if cxj G V{Si), for all i (z P, the notation gj{ai,a-i) simply stands for the 
number gj{cri, . . . ,cTAr). 

The main idea behind our definition is the principle of cooperation, that is, players 
try to forecast how the game would be played if they formed coalitions and then they 
play according to their best forecast. Borrowing a well known terminology from the 
literature on coalition formation (cf. [Ra08] ) . we give the following definition. 

Definition 4.1. A coalition structure is a partition p = (pi, . . . ,pk) of the player set 
P; that is, the p^'s are subsets of P such that Pa^pp = 0, for all 07^/?, and [_]pa = P- 

As mentioned in the Introduction, the idea is that each player i G P assigns a value 
to each coalition structure p and then plays according to the coalition structure with 
highest value. As described in Section [3l the idea to define the value of a coalition 
structure p for player i is to take an average of the following kind. Suppose that for all 
J C P \ {i} we have defined a number Ti^j{p) describing the probability that players 
in J abandon the coalition structure p and a number ei^j{p) describing the infimum of 
possible gains of player i when players in J abandon the coalition structure p. Then we 
(would) define 

JCP\{i} 

Our aim is to give a reasonable definition for the numbers ei^j{p) and Tij{p) under 
the assumption that players do not know each other and are not allowed to exchange 
information. Of course, this is only a real restriction of the theory: if the players 
know each other and/or are allowed to exchange information, this will reflect on the 
computation of the probability Tij{p). 

^It is well known that the study of infinite games can be very subtle. For instance, there is large 
consensus that, at least when the strategy sets do not have a natural structure of a standard Borel 
space, one must allow also purely finitely additive probability measures as mixed strategies, lead- 
ing to the problem that even the mixed extension of the utility functions is not uniquely defined 
[Ma97) ■ [St05| . [C"a-Mol2) . [Ca-Scl2) . In this first stage of the research we want to avoid all these technical 
issues and we focuse our attention only to finite games. 



A SOLUTION CONCEPT FOR GAMES WITH ALTRUISM AND COOPERATION 13 

Before defining the numbers eij{p) and Tij{p), we need to understand what kind 
of strategies agree with the coahtion structure p. Indeed, as mentioned in Section [3l 
if p 7^ ({!}, . . . , {A^}) is not the selfish coahtion structure, some profiles of strategies 
might not be acceptable by the players in the same coalition because they do not share 
the gain in a fair way among the players belonging to the same coalition p^. We can 
define a notion of fairness making use of the fairness functions fi . First observe that the 
hypothesis of working with gain functions expressed using the same unit of measurement 
for all players allows us to sum the gains of different players and, consequently, we can 
say that a coalition structure p = {pi, ■ ■ ■ ,Pk) generates a game with k players as follows. 
The players are the sets pa in the partition, the pure strategy set of is riiGpa 
the gain function of player p^ is 

dpcisi,. . . ,sn) = Qiisi,. . . ,sn) (2) 

This game, that we denote by Gp, has a non-empty set of Nash equilibric0 that we 
denote by Nash(^p). Since the players in the same pa are ideally cooperating, not all 
Nash equilibria are acceptable, but only the ones that distribute the gain of the coalition 
P(^ as fairly as possible among the players belonging to pa- 

To define the subset of fair of acceptable equilibria, fix i £ P and consider the 
restricted function 'g^ = 5'i|Nash(g ) ■ Nash(^p) — )• M. Since Nash(t?p) is compact and gi 
is continuous, we can find Wi € Nash(^p) maximizing g^. 

Definition 4.2. The disagreement in playing the profile of strategy cr G Nash(t7p) for 
the coalition p^ is the number 

DiSp,(o-) = J2 fi{9i{^i)^9M)) 

Recalling that the number fi{x,y) represents how much player i disappreciates to 
renounce to a gain of x and accept a gain of y < x, we obtain that, in to order to have 
a fair distribution of the gain among the players in the coalition pa, the disagreement 
Disp^ must be minimized. 

Definition 4.3. The Nash equilibrium a G Nash(t/p) is acceptable or fair for the coali- 
tion Pa, if o" minimizes Disp^{a). 

Since the set of Nash equilibria of a finite game is compact and since the functions fi 
are continuous, it follows that the set of acceptable equilibria is non-empty and compact. 

Let us say explicitly that this is the unique point where we use the functions fi. It 
follows, that, for a game G such that every game Gp has a unique Nash equilibrium, the 
cooperative equilibrium does not depend on the functions fi. 

The importance of the hypotheses about strict convexity in the second variable and 
strict concavity in the first variable of the functions fi should be now clear and is 
however described in the first of the following series of examples. 

Example 4.4. Consider a finite version of Nash's bargaining problem [NaSObj where 
two persons have the same strategy set 5*1 = S2 = S = {0,1, . . . , 100} and the gain 
functions are as follows: 



If p = (P) is the grand coalition, then Qp is a one-player game, whose Nash equilibria are all probability 
measures supported on the set of strategies maximizing the gain function. 
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, , f X, if X + y < 100 J , , f y, if X + y < 100 

gi(x,y) = < „ T , if^r. and g2\x,y) = < „ , .,„„ 

'^^ \ 0, ifx + y>100, \ 0, ifx + i/>100. 

As well known, this game has attracted attention from game theorists since, despite hav- 
ing many pure Nash equilibria, only one is intuitively natural. Indeed, many papers have 
been devoted to select this natural equilibrium adding axioms (see |Na50bj , |Ka-Sm75] , 
and |Ka77j ) or using different solution concepts (see |Ha-RolO ] and [ Ha-Pal2"] ). 

Assume that the two players have the same perception of money, that is /i = /2 =: /. 
Consider the cooperative coalition pc = ({1,2}) describing cooperation between the 
two players. The game Qp^ is a one-player game whose Nash equilibria are all pairs 
(x, 100 — x), X G S*!, and all probability measures on Si x ^2 supported on such pairs of 
strategies. Despite having all these Nash equilibria, the unique acceptable equilibrium 
for the game coalition is (50,50). Indeed, one has 

Disp^ (50, 50) = / (100, 50) + / (100, 50) 

= / {im, ^ • 100 + ^ • 0^ + / (^100, ^ • 100 + ^ • 

< i/(100, 100) + ^/(lOO, 0) + ^/(lOO, 100) + ^/(lOO, 0) 

= /(100,100) + /(100,0) 
= Disp, (100,0). 

Analogously, one gets Disp^ (50, 50) < Disp^((x, 100 — x)), for all x € {0, 1, ... , 100}, 
X 7^ 50. Consequently, (50, 50) is the unique acceptable equilibrium for the cooperative 
coalition pc- 

Now let ps = ({1}, {2}) be the selfish coalition structure. Then the unique acceptable 
equilibrium for player 1 is (100, 0) and the unique acceptable Nash equilibria for player 
2 is (0, 100). 

Example 4.5. As second example, we consider the Prisoner's Dilemma. As well known, 
this famous game was originally introduced by Flood in [F152j, where he reported on a 
series of experiments, one of which, now known as Prisoner's Dilemma, was conducted 
in 1950. Even though Flood's report is seriously questionable, as also observed by Nash 
himself (cf. |F152j . pp. 24-25), it probably represents the first evidence that humans 
tend to cooperate in the Prisoner's Dilemma. This evidence has been confirmed in 
|Co-D J-Fo-Ro96j . where the authors observed a non-negligible percentage of cooperation 
even in one-shot version of the Prisoner's dilemma. 

Here we consider a parametrized version of the Prisoner's Dilemma, as follows. Two 
persons have the same strategy set Si = ^2 = {C,D}, where C stands for cooperate and 
D stands for defect. Let ;U > 0, denote by Q^^^^ the game described by the following 
gains: 

C D 

C 1 + /X, 1-F/x 0,2 + /x 

D 2 + /x,0 1,1 
Therefore, the parameter ^ plays the role of a reward for cooperating. The intuition, 
motivated by similar experiments conducted on the Traveler's Dilemma (cf. Example 
14. 6p or on the repeated Prisoner's dilemma [DRFN08] . suggests that humans should 
play the selfish strategy D for very small values of and tend to cooperate for very 
large values of //. This intuition is in fact so natural that Fudenberg, Rand, and Dreber, 
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motivated by experimental results on the repeated Prisoner's dilemma, asked ^^How do 
the strategies used vary with the gains to cooperation?" (cf. |Fu-Ra-Drl2] . p. 727, Ques- 
tion 4). We will propose an answer to this question (for one-shot Prisoner's dilemma) 
in Example I5.2| where we will show that the cooperative equilibrium predicts a rate of 
cooperation depending on the particular gains and that such equilibrium is computable 
by a very simple formula (cf. Proposition I5.3|) . For now, let us just compute the ac- 
ceptable Nash equilibria for the two partitions of P = {1,2}. Let pc = ({1,2}) be the 
cooperative coalition structure, describing cooperation between the players. In this case 
we obtain a one-player game with gains: 



5p,(C,C) = 2 + 2/x gp,(C,D) = 2 + /i 5p,(D,C) = 2 + /i (7p,(D,D) = 2. 



whose unique Nash equilibrium (i.e., the profile of strategies maximizing the payoff) 
is the cooperative profile of strategies (C,C). Uniqueness implies that this equilibrium 
must be acceptable independently of the /j's. On the other hand, the selfish coalition 
structure ps = ({1},{2}) generates the original game, whose unique equilibrium is, 
as well known, the defecting profile of strategies (D,D). Also in this case, uniqueness 
implies that this equilibrium must be acceptable. 

Example 4.6. Finally, we consider the Traveler's Dilemma. This game was introduced 
by Basu in [Ba94j with the purpose to construct a game where Nash equilibrium makes 
unreasonable predictions. Basu's intuition was indeed confirmed by experiments on 
both one-shot and repeated treatments |Ca-Go-Go-Ho99] . |Go-HoOl] . |Be-Ca-Na05) . 



[Ba-Be-Stllj . Fix a parameter 6 € {2, 3, ... , 180}, two players have the same strategy 
set Si = S2 = {180, 181, . . . , 300} and payoffs: 



This game has a unique Nash equilibrium, which is (180, 180). Nevertheless, it has been 
observed that humans tend to cooperate (i.e. play strategies close to (300,300)) for 
small values of b and tend to be selfish (i.e., play strategies close to the Nash equilibrium 
(180, 180)) for large values of b. This is indeed what the cooperative equilibrium predicts, 
as we will see in Example 15.11 For now, let us just compute the sets of acceptable 
equilibria for all partitions of P = {1, 2}. Let pc = ({1, 2}) be the cooperative coalition 
structure, describing cooperation between the players. In this case we obtain a one- 
player game whose unique Nash equilibrium is attained by the cooperative profile of 
strategies (300,300). Uniqueness implies that this equilibrium must be acceptable. On 
the other hand, the selfish coalition structure ps = ({1},{2}) gives rise to the unique 
Nash equilibrium of the game, which is (180, 180). Also in this case, uniqueness implies 
that this equilibrium must be acceptable. 

Coming back to the description of the theory, we have gotten, for all partitions p 
of the player set P and for all sets pa of the partition, a (compact) set of acceptable 
equilibria AcCp^{Qp) for the coalition pa inside the coalition structure p. Now we define 
the numbers eij{p) and Tj{p). 

Definition of the numbers Ti^j{p). We recall that the number Ti^j{p) represents 
the probability that players i assigns to the event ^""players in J abandon the coalition 
structure p" . Consequently, it is enough to define the numbers Ti,j{p) when J = {j} 
contains only one element. The other numbers can be indeed reconstructed assuming 




and 
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that the events "player j deviates from p" and "player k deviates from p" are inde- 
pendent. This assumption is natural in this context where players are not allowed to 
exchange information. 

Therefore, fix j G P, with j ^ i. The definition of Tij{p) is intuitively very simple. 
It will be a ratio 

r (P) - ^^ (^^ 

^'''^P> D,{p) + R,{py 

where: 

• the number Dj{p) represents the incentive for player j to abandon the coalition 
structure that is, the maximal gain that player j can get leaving the coalition; 

• the number Rj{p) represents the risk that player j takes leaving the coalition 
structure p, that is, the maximal loss that player j can incur trying to achieve 
her maximal gain, assuming that also other players can abandon the coalition 
either to follow selfish interests or to anticipate player j's defection. 

To make this intuition formal, first define 

M{pa,p) ■■= {cr G Accp^(^p) : gp^{a) is maximal} . (3) 

The idea is indeed that players in the same coalition try to achieve their maximal 
joint gain but, doing that, there might be some conflicts among coalitions. Therefore, we 
are interested to look at the strategy profiles that can be constructed putting together 
pieces of strategies in the various M{pa,p)- 

To this end, let us fix a piece of notation. For a given player j, let ttj : V{Si) x 
... X V{Sn) — > VtySj) be the canonical projection. We may reconstruct an element 
a G Vi^Si) X ... X T'(Sn), through its projections and we write formally a = ^j=i 
Set 

Mipa,p) ■■= < 

and then 



TTiia) : a G M{pa,p) } , (4) 



k 

M{p) = I^M{p^,p). (5) 

a=l 

In words, M{p) is the set of strategy profiles that can be constructed putting together 
pieces of acceptable equilibria maximizing the joint gain of each coalition. 

Remark 4.7. It is worth mentioning that in many relevant cases all sets AI(pa,p) con- 
tain only one element and the computations get very simple and unambiguous. However, 
in some cases, as in the route choice game, this set may contain multiple and theoreti- 
cally even infinite elements. From a mathematical point of view, this is not a problem, 
since we need only compactness of the sets M{pa,p) and these sets are indeed compact. 
However, in some cases there might be a natural way to restrict the sets M{pa,p), lead- 
ing to a computationally lighter and intuitively more natural definition. For instance, in 
games with particular symmetries, as the basic route choice gam^, players are tipically 
indifferent among all pure Nash equilibria maximizing their gains and, therefore, it is 
natural to restrict the set M{pa,p) and take only its barycenter, which is, in this case. 



^There are 2N players, each of which has to decide the route to go to work between two equivalent 
routes. 
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the uniform measur^^. Theoretically, this construction may be extended to every game, 
since M{pa,p) is always compact and so it has a barycenter (see |Ha-Va89"] . Sec. 3.b). 
But we do not think that the assumption that players in the same pa are indifferent 
among all the acceptable strategies which maximize their joint gain is very general and 
it would not probably make sense in very asymmetric games. How to restrict the sets 
M{pa,p) is another point of the theory that deserves particular attention in a future 
research. 

Definition 4.8. Let a € ViSi) x ... x V{Sn) be a profile of mixed strategies and 
cr^ G V{Sk)- We say that cr^ is a fe-deviation from a if (7fc(cr^, o'-fc) > 

Now we can finally move towards the definition of incentive and risk. We recall, that 
we have fixed a coalition structure p and two players i,j G P, with i ^ j and we want to 
define the incentive and risk for player j to abandon the coalition structure. Let Devj(p) 
denote the set strategies of player j that are j-deviation from at least one strategy in 
M{p). 

Definition 4.9. The incentive for player j to deviate from the coalition structure p is 
Dj{p) := max {gj{aj,a-j) - gj{a) : {a, a'j) G Bevj{p)} . (6) 

Observe that Dj(p) is attained since the set Devj{p) is compact. 

If Dj{p) = 0, then j does not gain anything by leaving the coalition and therefore j 
does not have any incentives to abandon the coalition structure p. If it is the case, we 
simply define Tij{p) = 0. 

Consider now the more interesting case Dj(p) > 0, where player j has an actual 
incentive to deviate from the coalition structure p. If j decides to leave p, it may happen 
that she loses part of her gain if other players decide to abandon p either to follow selfish 
interests or to answer player j's defection. To quantify this risk, we first introduce some 
notation. Let (cjfj) G Devj(p) such that Dj{p) is attained. Call T{a,aj) the set of 
(^'-j € <S)i^j 'PiSi) such that 

• there is /c G P\{j} such that 7r^.((7'_j) is a ^-deviation from either a or a'-). 
Thus we quantify the risk by 

Rj{p) := sup {gj{a) -gj{aj,aij)} , (7) 
where the supremum is taken over all 

(A) (cr, (jj) G Devj(p) such that Dj{p) is attained, 

(B) a'_^eT{a,a'^). 

The requirement (A) is motivated by the fact that if player j believes that she can leave 
the coalition structure p to follow selfish interests, then she must take into account 
that also other players may deviate from p either to follow selfish interests or because 
they are clever enough to anticipate player j's defection. This can obstruct player j's 
deviation, if another player's deviation causes a loss to player j. 

Definition 4.10. The prior probability that player j deviates from the coalition struc- 
ture p is 

Djip) 



Dj{p) + Rj{p)- 



''it was reported in |RKDGd9] that players tend to play uniformly in the basic route choice game. 
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The terminology prior wants to clarify the fact that the event ^^player j abandons 
the coalition''^ is not measureable in any absolute and meaningful sense. The prior 
probability is a sort of measure a priori of this event knowing only mathematically 
measurable information, as monetary incentive and monetary risk. 

Remark 4.11. If the set T{a, a'j) is empty for all (cr, a'j) £ Devj(p), then the supremum 
defining the risk Rj{p) is equal to zero. Consequently, the prior probability that player 
j abandons the coalition structure p is equal to 1. This is coherent with the intuition 
that if T{a,aj) = 0, then there is no way to obstruct player j's defection. 

As said before, we can now compute all remaining probabilities Tj^j(p) assuming that 
the events player j deviates from p" and player k deviates from p" are independent. 
In particular, Ti^(if{p) will represent the probability that none of the players other than i 
deviates from the coalition structure. 

Definition of the numbers eij{p). We recall that the numbers ei^j{p) represent the 
infimum of gains of player i when the players in J decide to deviate from the coalition 
structure p. Therefore, the definition of these numbers is very straightforward. Let 
J C P \ {i}, we first define the set 

k 

Devj(p) := I {a,a'j) el^M{p^,p) x l^r{Sj) : 3j e J : gj{7rj{a'j),a-j) > gj{a) 

a=l jeJ 

Then we define 

e-iAv) ■= inf{c/i(o-j,cj_j) : {a,a'j) G Devj(p)}. 
Definition 4.12. The value of the coalition structure p for player i is 

Vi{p) = ^ ei,j{p)Ti,j{jp). (8) 

JCP\{i} 

We stress that at this first stage of the research we cannot say that this formula is 
eventually the right way to compute the value of a coalition structure. It just seems 
a fairly natural way and, as we will show in Section [5l it meets experimental data 
satisfactorily well. However, it is likely that a future research, possibly supported by 
suitable experiments, will suggest to use of a different formula. For instance, we will 
describe in Example ?? that it is possible that the deviation Dj{p) should be computed 
taking into account not only deviation to achieve higher gains, but also to get a safe 
gain. 

Now, in an exact theory, player i is assumed to have unbounded rationality and 
is assumed not to make mistakes in the computations and so, using the principle of 
cooperation, she will play according to some p which maximises the value function Vi. 
It remains to understand the meaning of playing according with a coalition structure p. 
Indeed, we cannot expect that player i will play surely according to an acceptable Nash 
equilibrium of Qp, since she knows that other players may deviate from the coalition. 
What we can do is to use the numbers Vi{p) to define a sort of beliefs. 

Definition 4.13. Let A C P(S'i) x . . . x V{Sn)- The subgame induced by A is the 
game whose set of mixed strategies of player i is the closed convex hull in V{Si) of the 
projection set iri{A). 
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Therefore, a subgame induced by a set A is not, strictly speaking, a game, since 
in general the set of mixed strategies of player i cannot be described as the convex 
hull of a set of pure strategies which is a subset of Si. In the induced game only 
particular mixed strategies are allowed, which, as said earlier, correspond to some sort 
of beliefs. Observe that, since the set of allowed mixed strategies is convex and compact, 
we can formally find a Nash equilibrium of an induced game. Indeed, Nash's proof of 
existence of equilibria does not really use the fact that the utility functions are defined 
on V[Si) X ... X V{Sn)i but only that they are defined on a convex and compact subset 
oiV{Si) X ... X V{Sn). 

Let Ind(^,p) be the subgame induced by the set of strategies a G 'P(»S'i) x . . . x V{Siy) 
such that gi{a) > Vi{p), for all i £ P. Observe that the induced game is not empty, 
since Vi (p) is a convex combinations of infima of values attained by the gain function gi . 

Definition 4.14. (Exact cooperative equilibrium) An exact cooperative equilib- 
rium is one where player i plays a Nash equilibrium of the subgame lnd{G,p) where p 
maximizes Vi (p) H. 

One could define a quantal cooperative equilibrium, declaring that player i plays 
with probability e^"""- / Yip c'^^' according to the quantal response equilibrium or the 
quantal level-k theory applied to lnd{Q,p). At this first stage of the research, we are 
not interesting in such refinements, that could be useful in future and deeper analysis 
(cf. Examples ?? andlOTHD. 



5. Examples and experimental evidence 

In this section we apply the cooperative equilibrium (under expected utility theory 
and without using altruism) to some well known games. The results we obtain are en- 
couraging, since the predictions of the cooperative equilibrium are always satisfactorily 
close to the experimental data. We present also two examples where the coopera- 
tive equilibrium makes new predictions, completely different from all standard theories. 
These new predictions are partially supported by experimental data, but we do not have 
enough precise data to say that they are strongly confirmed. 

Example 5.1. Let Q^^^ be the parametrized Traveler's Dilemma in Example 14.61 with 
bonus-penalty equal to b. Let pc = ({1, 2}) be the cooperative coalition. We recall that 
in Example 14.61 we have shown that the profile of strategies (300, 300) is the unique 
acceptable equilibrium for pc- To compute the values of pc, let i = 1 (the case z = 2 is 
the same, by symmetry). One has D2{pc) = b—1, corresponding to the strategy profile 
(300,299). Corresponding to this deviation of player 2, which is the unique deviation 
maximizing player 2's gain, the best deviation for player 1 is to play the strategy 298, 
which gives 52 (300, 300) -52 (298, 299) = 2 + b. Therefore, i?2(Pc) = 2 + b. Consequently, 
we have 

n,{2}(Pc) = ^^ and n,0(Pc) = ^^. 

Now, ei {2} = 300 — 2b, corresponding to the profile of strategy (300,300 — b), and 
Ci,0(Pc) = 300. Consequently, setting vi(pc) = V2{Pc) ='■ v{pc), we have 

6 + 2 , 6-1 
ViPc) = 300 • — — + (300 - 26) • 



26 + 1 ' '26+1 



Observe that this is well defined also in case of multiple p's maximizing Vi{p), since the induced games 
Ind(t/,p) and Ind(t/,p') are the same, if p,p' are both maximizers. 
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On the other hand, the selfish coalition structure ps = ({1}, {2}) has value 

VliPs) = V2{Ps) = 180, 

since there are no possible deviations from a Nash equilibrium. Therefore, for small 
values of 6, one has vij)c) > v{ps) and the cooperative equilibrium predicts that agents 
play according to the cooperative coalition; for large values of b, one has v{ps) > v{pc) 
and then the cooperative equilibrium predicts that agents play the Nash equilibrium. 
Moreover, the rate of cooperation depends on b: the larger is b, the smaller is the 
rate of cooperation predicted. We are aware of only two experimental studies devoted 
to one-shot Traveler's dilemma. In this cases, the predictions are even quantitatively 
close. 

• For 6 = 2 and S*! = 5*2 = {2, 3, . . . , 100}, it has been reported in |Be-Ca-Na05] 

that most of subjects (38 out of 45) chose a number between 90 and 100 and 
the strategy which had the highest payoff was s = 97. In our case, we obtain 

6+2 6-1 

v(pc) = 100 • + 96 • = 99.2. 

' 26 + 1 26 + 1 

Consequently, the cooperative equilibrium is supported near 99. 

• For 6 = 5 and Si = S2 = {180, 181, . . . , 300}, it has been reported in fCto-HoOl] 
that about 80 per cent of the subjects submitted a strategy between 290 and 
300, with an average of 295. In our case, we obtain 

6+2 6-1 

v(pc) = 300 • + 290 • = 296.35. 

26 + 1 26 + 1 

Consequently, the cooperative equilibrium is supported between 296 and 297, 

which is very close to the experimental data. 

• For 6 = 180 and Si = S2 = {180, 181, . . . , 300}, it was reported in |Go-Ho01] 
that about 80 per cent of the subjects played the Nash equilibrium 180. In our 
case, one easily sees that 

v{pc) <v{ps) 

Consequently, the cooperative equilibrium reduced to Nash equilibrium and pre- 
dicts the solution (180, 180). So the cooperative equilibrium coincides with what 
most subjects played. 



Example 5.2. We consider the parametrized Prisoner's dilemma as in Example | 
Observe that all known solution concepts predict either defection for sure or coopera- 
tion for sure. Nevertheless, the data collected on the conceptually similar parametrized 
Traveler's dilemma suggest that human behavior in the parametrized Prisoner's dilemma 
should depend on the parameter. This intuition is partially supported by the results 
presented in [DRFN08j . where the authors reported on experiments conducted on the 
repeated Prisoner's dilemma with punishment and observed that subjects tend to co- 
operate more when the cost of cooperating is smaller. Motivated by these experimental 
data, Fudenberg, Rand, and Dreber indeed asked "i7ow do the strategies used vary with 
the gains to cooperation?" (cf. |Fu-Ra-Drl2] . p. 727, Question 4). 

We now show that in fact cooperative equilibrium predicts a rate of cooperation which 
depends on the particular gains. 

Proposition 5.3. The unique cooperative equilibrium of the parametrized Prisoner's 
dilemma Q^^^ is: 

. {D,D) iffi<l, 
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In particular, the cooperative equilibrium ofG^^^ verifies the following appealing property: 

(1) It predicts defection for fi = 0, 

(2) It moves continuously and monotonically from defection to cooperation, as fi 
increases, 

(3) It converges to cooperation as fi ^ oo. 

Proof. The cooperative coalition structure pc = ({1,2}) gives rise to a one-player game 
whose unique Nash equilibrium is the cooperative profile (C,C). The value of this 
coalition is, for both players, 

Vl{Pc) = MPc) = (1 + ^) ^1 - = 

The selfish partition ps = ({!}, {2}) gives rise to the classical Nash equilibrium (D, D). 
The value of ps is then, for both players, 

Vl{Ps) = V2{Ps) = 1 

Therefore, for /i < 1 one has vi{pc) = 1^2 (Pc) < vi{ps) = V2{ps) and therefore the 
cooperative equilibrium predicts defection. To compute the cooperative equilibrium for 
^ > 1, first we need to find all profiles of strategies (fii, (T2) such that 

fs'l(f^l,f^2) > ^ 

To this end, set ai = XiUi + (1 — Ai)6i and (T2 = X2a2 + (1 — ^2)62- From Equation ^ 
one gets 

('AiA2(1 + ^) + (1 - Ai)A2(2 + + (1 - Ai)(l - A2) > ^ 
\AiA2(1 + ^) + (1 - A2)Ai(2 + /i) + (1 - Ai)(l - A2) > /u 

To compute the Nash equilibrium restricted to the induced game defined by these strate- 
gies is very easy. Indeed, it is clear, by simmetry, that this Nash equilibrium must be 
symmetric and so it is enough to find the lowest A such that (A, A) is a solution of (fTO|) . 
One easily finds A = as claimed. □ 

As a specific example of a one-shot Prisoner's dilemma, we consider the one recently 
experimented using MTurk in |DEJR12] with monetary outcomes (expressed in dollars) 
T = 0.20, R = 0.15, P = 0.05,5 = 0. Fix i = 1. Denote by ps the selfish coalition 
structure, where the players are supposed to act separately. Then Qp^ = Q, whose unique 
Nash equilibrium is {D,D). Since a Nash equilibrium has no deviations, then D2{pc) = 
and consequently v{ps) = 0.05. Now, let pc be the cooperative coalition structure, where 
the players are supposed to play together. The game Qp^ is a one-player game whose only 
Nash equilibrium is (C, C). Now, D2{pc) = 0.05, since the second player can get 0.20 
instead of 0.15 if she defects and the first player cooperates, and R2{Pc) = 0.10, since 
the second player risks to get 0.05 instead of 0.15 if also the other player defects. Finally 
ei 0(pc) = 0.15 and ei^2(Pc) = 0. Consequently, v{pc) = 0.10, that is larger than v{ps). 
So we need to compute the Nash equilibrium of l\id{Q ,pc). By symmetry of the game, 
this is the same as finding the smallest A such that 0.15A^ + 0.2A(1 - A) + 0.05(1 - A)^ > 
0.1, that is A = |. Consequenty, the cooperative equilibrium of this variant of the 
Prisoner's dilemma is + i^D for both players. Notice that in [DEJR12j it has been 
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reported that players cooperated with probabihty 58 per cent in one treatment and 65 
per cent in another treatment and the over-cooperation in the second experiment was 
explained in terms of framing effect due to the different ways in which the same game 
were presented. 

Example 5.4. Let us consider the Bertrand competition. Each of N players simultane- 
ously chooses an integer between 2 and 100. The player who chooses the lowest number 
gets a dollar amount times the number she bids and the rest of the players get 0. Ties 
are split among all players who submit the corresponding bid. 

The unique Nash equilibrium of this game is to choose 2. Nevertheless, it has been 
reported in |Du-GnOO] that humans tend to choose larger numbers. It was also observed 
that the claims tend to get closer to the Nash equilibrium, when the number of players 
gets larger. 

To compute the value of the cooperative coalition pc = ({Ij ■ ■ ■ , N}) we observe that 
every player j has incentive Dj{pc) = 49 and risk Rj{pc) = 50. We then obtain 

. For iV = 2, viipc) = V2{pc) = 50 • f , 

• For = 4, one has 

„fe) = ^^^=..W = 50.(l-3.| + 3.(|y-(|)y 

• and so forth. 

In other words, using the law of total probability, one can easily show that the value 
of the cooperative coalition converges to very quickly. Consequently, when N in- 
creases, the value decreases and the cooperative equilibrium predicts smaller and smaller 
claims. This matches qualitatively what reported in a repeated Bertrand competion in 
[Du-ClnOO j. 

Example 5.5. In this example we show that the cooperative equilibrium theory fits 
an experiment reported by Kahneman, Knetsch and Thaler in jKKT86] . Consider the 
ultimatum game. A proposer and a responder bargain about the distribution of a surplus 
of fixed size that we suppose normalized to ten. The responder's share is denoted by 
s and the proposer's share by 10 — s. The bargaining rules stipulate that the proposer 
offers a share s € [0, 10] to the responder. The responder can accept or reject s. In 
case of acceptance the proposer receives a monetary payoff 10 — s, while the responder 
receives s. In case of a rejection both players receive a monetary return of zero. 

Kahneman, Knetsch and Thaler conducted the following experiment: 115 subjects, 
divided in three classes, were asked to say what would be the minimum offer (between 
and 10 Canadian dollars) that they would accept, if they were responders. The mean 
answers were between 2.00, 2.24 and 2.59 (see |KKT86] . Table 2). 

Now, cooperative equilibrium theory predicts that the responder would accept any 
offer larger than the value of the coalition structure with the largest value. So let us 
compute the value for the responder of the two coalition structures ps and pc assuming 
that the two players have the same perception of money. 

Denote by A and R responder's actions accept and reject, respectively. As in Nash 
bargaining problem, we obtain that the cooperative coalition pc = ({1,2}) leads to a 
one-player game Qp^ with the unique acceptable equilibrium (5,^4). Therefore, we have 
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since the first player can abandon the coahtion playing every s < |, but she risks to 
lose everything if the second player rejects the offer (observe that i? is a 2-deviation to 
the strategy s = 0). On the other hand, of course, one has v{ps) = 0, corresponding to 
the equilibrium {0,R). 

Consequently, cooperative equilibrium theory predicts that the responder would ac- 
cept any offer larger than 2.5 dollars, which fits the experimental data reported in 
|KKT86j . 

In a very recent and not yet published experiment. Wells and Rand |We-Raj re- 
ported that the average claim of 44 subjects was 10.7 out of 30 monetary units. This 
corresponds to 35.6 per cent which is apparently quite larger than what cooperative 
equilibrium predicts. However, making the average between the (normalized) results in 
|KKT86j and |We-Raj - 44 subjects claimed an average of 0.356, 43 subjects claimed an 
average of 0.259, 37 subjects claimed an average of 0.224, and 35 subjects claimed an 
average of 0.200 - one finds an average claim of 0.264, which is in fact very close to the 
prediction of the cooperative equilibrium, which is 0.25. 

Remark 5.6. The cooperative equilibrium can predict well also other experimental 
data collected for the ultimatum game. 

Recall that the unique subgame perfect equilibrium of the ultimatum game is to 
offer s = 0. Nevertheless, there are numerous experimental studies which reject this 
prediction and show that proposers almost always make substantially larger offers. Fehr 
and Schmidt |Fe-Sc99] explained these observations making use of two parameters ctj , /3j 
for each player. Let us find out what happens using cooperative equilibrium. 

Concerning the selfish coalition ps = ({1}, {2}). One easily sees that 

Vi{ps) = V2{Ps) = 0, 

in correspondence to the subgame perfect equilibrium (0,i?). Concerning the coopera- 
tive coalition, we have 

Vl{Pc) = ^, 

since the second player has no incentive to abandon the coalition, and 

V2{Pc) = ^, 

as shown in Example 15.51 Consequently, the exact cooperative equilibrium predicts 
that the proposer offers s = 0.25 and the responder accepts. This explains the fact that 
there are virtually no offer below 0.2 and above 0.5, which was observed in [Fe-Sc 99] mak- 
ing a comparison among experimental data collected in [GSS82J, |KKT86] . [FHSS88J, 
|EP()Z91j . [Dial], |HMcS96j . and |Sl-E,o97j . 

So there are some data that can be explained by the cooperative equilibrium under 
expected utility theory and without altruism. Other data can be explained using al- 
truism. For instance, it was observed that proposer's offer was very often higher than 
0.25 and, in most of the cases, it was between 0.4 and 0.5 (cf. |Fe-Sc99| . Table I). 
This stronger deviation towards cooperation is not predicted by the exact cooperative 
equilibrium without altruism and we will show in Example 18.111 how the cooperative 
equilibrium with altruism can explain it. 

We now discuss an example that we believe is relevant because it makes predictions 
that are significantly different from Nash equilibrium. Such predictions are partially 
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confirmed by experimental data, but it would be important to conduct more precise 
experiments in order to see how humans behave in such a situation. 

Example 5.7. Let us consider the A^-player public good game. There are players, 
each of which has to decide on her contribution level Xi E [0, y] to the public good. The 
monetary payoff of player i is given by 

gi{xi,X2, . . .,xn) = y -Xi + a{xi +X2 + ...+ xn), 

where < a < 1 denotes the constant marginal return to the public good X = 
X1 + X2 + ■ ■ ■ + Xn- Notice that the unique perfect equilibrium is to choose Xi = 0. Nev- 
ertheless, this free ride hypothesis has been rejected by numerous experimental studies 
(see, e.g., [Ma-Am8l] . [Is-Wa88j . |IWW94j . |Le95j ). In particular, it was explicitly re- 
ported in jls-WaSBj and |IWW94j the intuitive fact that, for a fixed number of player, 
claims get larger as a get larger and the much less intuitive fact that, for a fixed a, 
claims get larger when the number of players is large enough. We now show that the 
first property is predicted by the cooperative equilibrium and we anticipate that the 
second property is predicted by the cooperative equilibrium under cumulative prospect 
theory. 

Proposition 5.8. Let N, the number of players, be fixed. Denote v{pc) and v{ps) 
respectively the value of the cooperative coalition structure pc = {{1, ■ ■ ■ , N}) and of the 
selfish coalition structure ps = ({!}, • • • , {-^})- Then the function v{pc) — v{ps) has the 
following properties: 

(1) it is strictly increasing in the variable a, 

(2) it is negative for a = j^, 

(3) it is positive for a = 1. 

The proof of this proposition is a long and tedious computation. Here we report 
explicitly only the proof for N = 2. How to treat the general case should then be clear 
(use the law of total probabilities). 

Proof of Proposition I5.<S'I with N = 2. Let pc = ({1,2}) be the cooperative coalition 
structure. The unique Nash equilibrium of the game Qp^ is (y, y) and each of the 
two players gets ei ©(pc) = 2ay. Assume i = 1 (the case i = 2 is symmetric). Observe 
that D2{pc) = y + oty — 2ay = y — ay. Indeed, the best deviation for player 2 is to play 
X2 = 0, which gives a payoff of y -|- ay, if xi = y. The risk is R2{pc) = 2Qy — y. Indeed, 
if also player 1 abandons the coalition pc to play the selfish strategy xi = 0, player 2 
would get y instead of 2ay. Consequently 

y-ay I- a 

n,{2}(Pc = -T. = • 

^ ' y — ay + lay — y a 

On the other hand, one has 

ei,{2}(Pc) = ay, 

corresponding to player 2's defection. Therefore, 

2a — 1 1 — a , 

vi{Pc) = V2[Pc) = '2ay h ay = (3a - l)y. 

a a 

On the other hand, the selfish coalition ps = ({!}, {2}) has value y, corresponding to 
the equilibrium (0,0). Consequently, the function v{pc) — v{ps) is strictly increasing in 
the variable a and one has 
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□ 

As a quantitative comparison, we consider the experimental data reported in [GHL02| . 
with a = 0.8. We normahze y to be equal to 1 (in the experiment y = 0.04 dollars). In 
this case the cooperative equilibrium is supported between 0.66 and 0.67. In |GHL02| it 
has been reported that the average of contributions was 0.50, but the mode was 0.60 (6 
out of 32 times) followed by 0.80 (5 out of 32 times). 

Example 5.9. We consider the finite version of Nash's bargaining problem as in Ex- 
ample It is well known that the unique reasonable solution is (50, 50) and indeed a 
number of theories has been developed to select such a Nash equilibrium. For instance, 
in |Na50bj , |Ka-Sm75] , and |Ka77j , the authors studied a set of additional axioms that 
guarantee that the unique solution of Nash bargaining problem is a 50-50 share. Other 
solutions, based on different solution concepts, have been recently proposed in jHa-RolO] 
and jHa-Pal2j . 

Now we show that also the cooperative equilibrium predicts a 50-50 share, if the two 
players have the same perception of gains. 

Proposition 5.10. If the two players have the same perception of money, that is, 
fx = f2, then the unique exact cooperative equilibrium is (50,50). 

Proof. As we have already seen in Example l4.4l the cooperative partition pc has a unique 
acceptable profile of strategies, which is (50,50). Observe that Devj{p) = 0, for all j, 
and therefore lnd{G,p) is the game where both players can choose only the strategy 50. 
Consequently, we have 

Vl{Pc) = V2{Pc) = 50. 

Now consider the selfish coalition structure ps = ({1},{2}). This time the unique 
acceptable equilibria are 

Acc|i|(^pJ = (100,0) Acc|2}(gpJ = (0,100). 

Observing that Dev{ps) = 0, we then obtain 

viiPs) = giiWO, 100) =0. 

Analogously, we obtain V2{ps) = (72(100, 100) = 0. Therefore the value of the cooper- 
ative coalition structure is larger than the value of the selfish coalition structure and, 
consequently, the set of exact cooperative equilibria of Nash bargaining problem coin- 
cides with the set of Nash equilibria of the induced game lnd{Q,pc)- Since this induced 
game contains only one profile of strategies, which is (50,50), this is then its unique 
exact cooperative equilibrium. □ 

We mentioned in the Introduction that there are other solution concepts that have 
been proposed in the last few years and we have discussed why believe that Renou- 
Schlag-Halpern-Pass's iterated regret minimization is the most promising of them: the 
others are either too rigid or inapplicable to one-shot games. Contrariwise, iterated 
regret minimization can explain deviations from Nash equilibria in several games. Nev- 
ertheless, as observed in [II a-Pal2] . it fails to predict human behavior for some other 
games, such as the Prisoner's dilemma, the public good game, and the Traveler's 
dilemma with punishment. We have already computed the cooperative equilibrium 
for the Prisoner's dilemma and the public good game and we now make a parallelism 
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between iterated regret minimization and cooperative equilibrium for the Traveler's 
dilemma with punishment. 

Example 5.11. Consider a variant of the Traveler's dilemma that has been proposed 
in [Ha-Pal2] . Section 6. Let us start from the Traveler's dilemma in Example 14.61 where . 
this time, the strategy set is {2,3,..., 100} for both players and the bonus-penalty is 
6 = 2. Suppose that we modify this variant of the Traveler's dilemma so as to allow 
a new action, called P (for punish), where both players get 2 if they both play P, but 
if one player plays P and the other plays an action other than P, then the player who 
plays P gets 2 and the other player gets —96. In this case {P, P) is a Nash equilibrium 
and it is also the solution in terms of regret minimization. As observed in |Ha-Pal2] . 
this is a quite unreasonable solution, since the intuition suggests that playing P should 
not be rational. In fact, one can easily check that, from our point of view, this game 
is absolutely the same as the original Traveler's dilemm^ and therefore it has got the 
same cooperative equilibria. 



In the previous section we have discussed a set of examples where the cooperative 
equilibrium under expected utility theory predicts human behavior satisfactorily well. 
On the other hand, since we are working with gain functions, it is natural to use cu- 
mulative prospect theory instead of expected utility theory. But before describing the 
cooperative equilibrium under cumulative prospect theory, we discuss a few examples 
where the passage from expected utility theory to cumulative prospect theory may 
explain observations that are not consistent with the cooperative equilibrium under 
expected utility theory. 

Example 6.1. We mentioned before that has been observed that contributions in the 
Public Goods game depend on the number of players in a puzzling way: they first 
decreases as the number of players increases, but then, when the number of players if 
sufficiently large, they increase again. This behavior is not predicted by the cooperative 
equilibrium under expected utility theory, which predicts that contributions decreases 
as the number of players increases. Nevertheless, this behavior is consistent with the 
cooperative equilibrium under cumulative prospect theory. 

Indeed, given the A'^-player Public Goods game with marginal return a, the prior 
probability that player j abandons the coalition is 



Consequently, when N is large enough, all the events "j abandons the coalition" have 
negligible probability. Now, one of the principles of cumulative prospect theory is that 
decision makers treat extremely unlikely events as impossible (see [Ka-Tv79] . p. 275) 
and therefore, a part from very risk averse people, most of the agents would actually 
replace this probability just by 0. So the cooperative equilibrium is consistent with the 
tendency to cooperate that has been observed in large groups. 

Example 6.2. The following game has been proposed by J. Halpern in a private com- 
munication. Two players have the same strategy set {a, b, c} and the gains are described 
by the matrix 

'^Basically because strategies with very small payoff, such as P, do not enter in our computation of the 
value of the cooperative coalition. 



6. Towards cumulative prospect theory 




1 — a 
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a b c 



a x,x 0,0 0,y 
b 0,0 x,x 0,7/ 
c y,0 y,0 y,y 



where x > y > 0. In this case one finds v{pc) = v{ps) = and consequently, the set 
of exact cooperative equihbrium is equal to the set of Nash equilibria. Nevertheless, 
in this case it is very likely that if y and x are very close and much larger than 0, 
then the two players should coordinate and play the safe strategy c. Also this behavior 
would be predicted by the cooperative equilibrium under cumulative prospect theory: 
the strategies a and b are deleted a priori since perceived too risky with respect to the 
safe strategy. 



The examples described in the previous sections give one more motivation to aban- 
don expected utility theory and use cumulative prospect theory. Before starting the 
description of the cooperative equilibrium under cumulative prospect theory, we take 
this short section to give a short introduction to this theory. 

By definition, a prospect p = {x-m,P-m; ■ ■ ■ ; x^i,p-i; xo,Po; xi,pi; . . .■,Xn,Pn) yields 
outcomeJ^ X-m < ■ ■ ■ < x_i < xq = < xi < . . . < x„ with probabilities pi > 0, for 
i / 0, and po > 0, that sum up to 1. 

Expected utility theory was founded by Morgenstern and von Neumann in |Mo-vN4"7] 
to predict the behavior of a decision maker that must choose a prospect among some. 
Under certain axioms (see, for instance, |Fi82] ) Morgenstern and von Neumann proved 
that a decision maker would evaluate each prospect p using the value 



where u{xi) is the utility of the outcome Xi, and then she would choose the prospect(s) 
maximizing V{p). 

It has been first realized by M. Allais in [A153j that a human decision maker does 
not really follow the axioms of expected utility theory and, in particular, she evaluates 
a prospect using an evaluation procedure different from the one in (jlip . A first attempt 
to replace expected utility theory with a theory founded on different axioms and able to 
explain deviations from rationality was done in [Ka-Tv79] . where Kahneman and Tver- 
sky founded the so-called prospect theory. This novel theory encountered two problems. 
First, it did not always satisfy stochastic dominance, an assumption that many theorists 
were reluctant to give up. Second, it was not readily extendable to prospects with a 
large number of outcomes. Both problems could be solved by the rank-dependent or 
cumulative functional, first proposed by Quiggin | Qu82| for decision under risk and by 
Schmeidler |Sc89j for decision under uncertainty. Finally, Kahneman and Tversky were 
able to incorporate the ideas presented in |Qu8 2| and |Sc89j and developed their cumu- 
lative prospect theory in |Tv-Ka92] . Prospect theory and cumulative prospect theory 

^'-'Prospect theory and cumulative prospect theory have been originally developed for monetary outcomes 
(see [Ka-Tv79] . p. 274, 1.4), giving us one more motivation to abandon utility functions and work with 
gain functions. Kahneman and Tversky's choice to work with monetary outcomes is probably due to 
the second principle of their theory, as it will be recalled little later. 



7. A BRIEF INTRODUCTION TO CUMULATIVE PROSPECT THEORY 



n 
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have been successfully applied to explain a large number of phenomena that expected 
utility theory was not able to explain, as the disposition effect |Sh-St85) . asymmetric 
price elasticity j Pu92) . [Ha- Jo-Fa93] . tax evasion |Dh-No07] . as well as many problems 
in international relations |Le92j . finance [Th05| . political science |Le03] . among many 
other£3- 

The basic principles of cumulative prospect theory are the following. 

(PI) Decision makers weight probabilities in a non linear manner. In particular, 

the evidence suggests that decision makers overweight low probabilities and 

underweight high probabilities. 
(P2) Decision makers think in terms of gains and losses rather than in terms of their 

net assetJ^. 

(P3) Decision makers tend to be risk-averse with respect to gains and risk-acceptance 

with respect to lossef^. 
(P4) Losses loom larger than gains; namely, the aggravation that one experiences in 

losing a sum of money appears greater than the pleasure associated with gaining 

the same amount of money. 

The consequence of these principles is that decision makers evaluate a prospect p 
using a value function 

n 

j=-m 

that is completely different from the one in (jlip . To understand the explicit shape of 
the functions v and tt is probably the most important problem in cumulative prospect 
theory. About the function v, it has been originally proposed in [Tv-Ka92] to use the 
function 

x", if X > 0; 

-X{-xf, ifx<0. 

where experiments done in |Tv-Ka92] gave the estimations a ~ /3 ~ 0.88 and A ~ 2.25. 
About the function vr, the situation is much more intrigued: cumulative prospect theory 
postulates the existence of a strictly increasing surjective function if : [0, 1] —t- [0, 1] such 
that 

TT-m = w{p-rn) 
VT-m+l = w{p-m + P-m+l) - w{p^rn) 



j<0 



v[x 




J >0 



11 

^■^This principle is probably the one which forced Kahneman and Tversky to work with monetary 



The two papers in prospect theory and cumulative prospect theory have more than 30000 citations. 

This principle is probably the one which forced 
outcomes and force us to work with gain functions. 
^■^As a consequence, risk aversion is already taken 
consider it explicitly in the definition of a game in explicit form. 



^■^As a consequence, risk aversion is already taken into account and this is why we did not need to 



A SOLUTION CONCEPT FOR GAMES WITH ALTRUISM AND COOPERATION 



29 



■Kn-1 = W{pn-1 +Pn) " w{pn) 
TTn = w{pn) 

A first proposal of such a function w was made by Tversky and Kahneman themselves 
in |Tv-Ka92j and it is 

w{p) 



p-y 



1 



{p-f + {i-pp)'> 

where 7 has been estimated to belong to the interval in [Ri-Wa06] . Other 

functions w h a ve been p r oposed in [1^79], [G^^^Ei87] . [R87] . |(hi-Sa89j . |La-Ba-Wi92] . 
|Lu-Me-(]h93] . |He-()r94| . [P798] . and |Sa-Se98] . 

It is not our purpose to give too many details about the enormous literature devoted 
to understanding the evaluation procedure in cumulative prospect theory. Our purposes 
were indeed to give a brief introduction to the theory and stress how this theory implies 
the necessity to work with gain functions instead of utility functions. So we now pass 
to the description of the cooperative equilibrium for finite games in explicit form under 
cumulative prospect theory and taking into account altruism. 



8. Iterated Deletion: the set of playable strategies 

The cooperative equilibrium under cumulative prospect theory and taking into ac- 
count altruism will be defined through two steps. In the first step we use the altruism 
functions aij to eliminate the strategies that are not good for the collectivity. The sec- 
ond step is the prospect theoretical analogue of the procedure described in Section HI 
applied to the subgame obtained after eliminating the strategies in the first step. 

In this section we describe the first step of the construction, that we call iterated 
deletion. As well known, iterated deletion of strategies is a procedure which is common 
to most solution concepts (in Nash theory, one deletes dominated strategies; in iter- 
ated regret minimization theory, one deletes strategies which do not minimize regret; 
in Bernheim's and Pearce's rationability theory ([Be84] and |Pe84| ) . one deletes strate- 
gies that are not justifiable [Os-Ru94] ) . However, the use of altruism to delete strategies 
seems new in the literature. This iterated deletion of strategies is based on a new notion 
of domination between strategies, that we call super-dominatior0, which is motivated 
by the fact that human players do not eliminate weakly or strongly dominated strate- 
gies (as shown by the failure of the classical theory to predict human behavior in the 
Prisoner's and Traveler's Dilemmas). 

Each step of our iterated deletion of strategies is made by two sub-reductions. The 
first sub-reduction is based on the following principle: 

(CS) If Si G is a strategy for which there is another strategy s'^ G Si which gives a 
certain larger gain (or a certain smaller loss) to player i and does not harm too 
much the other players, then player i will prefer the strategy s[ and will never 
ever play the strategy Si. 

Thus, this principle states that every player is selfish unless the society gets a big 
damage. As we mentioned before, implicit in this principle there is a new notion of 
domination between strategies. 



A slightly stronger notion of domination between strategies has been independenlty introduced in 
[Ha-Pal3] . under the name mimmax domination. 
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Definition 8.1. Let Si,s[ G Si. We say that Si is super- dominated by and we write 
Si <i s'i, if 

(1) for all s-i, G 5_j, one has gi {si, s-i) < gi (s-, s'_j), 

(2) there are s_j, s'_j G S_j such that (sj, < ^ij (s^, s'_^). 

Observe that super-domination is much stronger than the classical notion of weak 
domination. This makes sense since it has been observed that in many situations, as in 
the Traveler's dilemma, players do not eliminate weakly dominated strategies, while it is 
clear that a purely selfish player would delete a super-dominated strategy. On the other 
hand, there is no direct relation between super-domination and strong-domination, as 
shown by the following examples. 

Example 8.2. Consider the following version of the Prisoner's dilemma 

L R 
U 2,2 0,3 
D 3,0 1,1 

The strategy D strongly dominates U and the strategy R strongly dominates the strategy 
L. Nevertheless, there are no super-dominated strategies, since gi{D,R) < gi{U,L) and 
g2{D,R)<g2{U,L). 

Example 8.3. Consider the two-person zero-sum game 

L R 
U 0,0 10,-10 
D 1,-1 1,-1 

In this case L super-dominates R, but R is not strongly dominated by L, since g2{D, R) = 
92iD,L). 

We will see in Example 18.131 that the notion of super-domination between strategies 
can be interesting in itself, since it allows to explain some phenomena that are not easy 
to capture making use of weakly and strongly dominated strategies. 

Before coming back to the theory, we need to fix some terminology. Fix dj G V{Si), 
the fiber game defined by cTj is the (A^ — l)-player game Q^. obtained by Q assuming 
that player i plays the strategy fjj surely. Formally, = g{P\{i},S-i,Q,gai,a^i, f-i), 
where g^i is the (A^ — l)-dimensional vector whose components are the functions gj{cri, •), 
with j G P \ {i}, a-i = {ajk)j,kGP\{i},j j^k, f-i = ifj)jeP\{i}- Using a trick which is 
conceptually similar to the one used in [H a-RolO] , we define the cooperative equilibrium 
by induction on the number of players. 

Definition 8.4. The cooperative equilibria of a one-player game are all probability 
measures supported on the set of pure strategies that maximize the gain function and 
give rise to acceptable equilibriJ^. 

Now we suppose that we have already defined the cooperative equilibrium for all 
{N — l)-player games and we define the cooperative equilibrium for all A^-player games. 
We denote by Coop(^) the set of cooperative equilibria of a game Q. 

Now, fix i G P and let s,t £ Si, with s <i t. If player i is believed to play the strategy 
t, the other players would answer playing an equilibrium of the fiber game Qt- Since the 



observed in Remark 14.71 when there are many such equihbria, it might make sense to consider 
only the barycenter. 
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fiber game has TV — 1 players, we may use the inductive hypothesis. We define the set 
of losers Lj(s,t) to be the set of players j & P such that 

(1) Qj {s,a^f>) > gj , for all G CoopiGt) , a^'J G Coop(g,), and 

(2) gj < g, (t,^^) , for all aL*] G Coop(gt). 

In words, Li(s,t) is the set of players that have a certain disadvantage when player i 
decides to play the strategy t instead of her worse strategy s (Condition (1)) and that 
are weaker than player i when she plays her better strategy s (Condition (2)). 

Now, if player i decides to renounce to play t and accept to play s, then she renounce 
to a certain gain of inf jy, ^t, : a^] G Coop(^t)|, to accept a smaller gain. Her 
maximal loss is then: 

Pi{s,t) := sup {inf (t,(7i*]) : a^l] G CoopiGt)] - gi [s,a^_!l) : a^l] G Coop(g,)} . 

On the other hand, the best that can happen to player j G Li{s,t) if player i decides 
to play her worse strategy s is 

Qj{s,t) := supjgj (s^'^-i) - iiif [dj (^'^-1) • ^-i ^ Coop(^?t)| : a-i{s) G Coop^^j . 
Now, set 

Pl{t) := inf [g, (i, a^*]) : a^*] G Coop(gt)} . 

In words, this number is the certain gain that player i would get if she decides to play 
her better strategy t. Now, set 

Q'j{s) := inf {5,- (^s,a^ll) : a^l] G Coop(a,)} . 

In words, this number is the certain gain that player j would get if player i decides to 
play her worse s. 

Therefore, we have reduced the problem of choosing s or i to the following problem: 
does player i accept to renounce to Pi{s, t) out of -P/(s, t) in order to give a gain of Qj{s, t) 
to player j, who already had a certain gain of Q'j{s,t)7 This is in fact a generalized 
dictator game. So, we set 

Ms,t) = a,,^^^,Pm,Q'j{s)y 

and we give the following definition. 

Definition 8.5. A strategy s € Si is unplayable of the first type for player i if there is 
another strategy t e Si such that 

• s <it 

• for all j G Li{s,t), one has Pi{s,t) > Aij{s,t). 
In this case we write s <j t. 

Example 8.6. Consider the game with gain matrix 

L R 
U 1,1 1,1 
D 1,1 2,1 
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Observe that U <i D. Moreover, Li{U,D) = and therefore the second condition in 
Definition 18.51 is true for trivial reasons. Consequently, the strategy U is unplayable of 
the first type for the first player. This happens, roughly speaking, because the column- 
player, playing D, can have a gain without damaging the row-player. 

Example 8.7. A little less trivial example is given by the game represented by the 
following gain matrix 

L R 
U 0,0 0,0 
D 1,-1 1,-1 

Assume ai2(l,l,— 1) < 1. Of course, U <i D. Now, observe that Pi^2iU,D) = 1 and 
that Ai2{s,t) = 012(1, 1, —1), thus the strategy U is unplayable of the first type for the 
vertical player. Roughly speaking, this happens because the vertical player, playing D, 
will get a certain gain giving a damage to the horizontal player that is small compared 
to her gain. 

Coming back to the theory, we would like to delete unplayable strategies of the first 
type. To this end, we need to prove a simple lemma. Given € Sj, let Maj^(sj) = 

Lemma 8.8. For all i a P, there exists Si £ Si such that Maf (si) = 0. 

Proof. By contradiction, let Maj^(sj) 7^ 0, for all Sj € Si. Fix sf''^ G Si. An iteration of 
the property Maj'^ 7^ allows to construct a chain 

By finiteness of the set Si, we may assume that at some point we get sf^^ = s^^\ with 

7^ s[^\ Observe that the relation <( might not be transitive, but the underlying 
relation <j is transitive. Therefore, we have gotten 

<. sf^ and sf^ <. ) 
that contradict each other. □ 

Let UnPl-^^(^) be the set of player i's unplayable strategies of the first type and 
denote by Plf ^(a) := Si\UnFl[^\g), that is well defined and non-empty by Lemma [8.81 
The notation Pl^] (^) stands for the cartesian product of all the Plj^''(t/)'s but Pl-"'^^(t/). 

Now we start the description of the second sub-restriction, that will be done through 
the definition of unplayable strategies of the second type. The principle underlying 
this second restriction is somehow the dual principle of the one underlying the previous 
restriction: 

(PA) If s € 5*4 is a strategy for which there is another strategy t & Si such that player 
i has a little disadvantage, but the other players have a big advantage, then 
player i will prefer the strategy t in order to help the society. 

As said earlier, the principle (CS) is a sort of controlled selfishness, whereas the 
principle (PA) sounds more like pure altruism. We can formalize it in a similar way as 
we formalized (CS). Indeed, we can use the number Pi{s,t) and Aij{s,t) in the dual 
way. 
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Definition 8.9. A strategy t gPI- (Q) is called unplayable of the second type for player 
i if there is another strategy s £ Pl^^^(^) such that 

(1) s <i t, 

(2) There exists j £ Li{s,t) such that Pi{s,t) < Aij{s,t). 

Example 8.10. Consider the standard dictator game Dict(l, 10, 0), that is, a proposer 
offers a division of 10 dollars, which the responder has to accept. The standard perfect 
equilibrium analysis of this games is that the proposer should keep all the money, 
since the responder has no say. Nevertheless, in experiments has been reported that 
most proposers offer a certain amount of money to the responder (see, for instance, 
[Fo-Ho-Sa-Se94] ) . Bolton and Ockenfels explained this anomalous behavior using equity 
in |Bo-OcOO] . We can explain it using iterated deletion of strategies using altruism. Let 
us model the set of strategies of the proposer, for simplicity, by S" = {0, 1, . . . , 10}. It 
is clear that there is a chain of super-dominated strategies for the proposer: <prop 
1 <prop 2 <prop • • • <prop 10. Now, One Can easily show that every strategy s with 
s < aprop,resp(l, 10, 0) is Unplayable of the second type for the proposer. Therefore, 
cooperative equilibrium theory predicts that the proposer offers a fairer division because 
of altruism. Moreover, the larger is aprop,resp(l) 10,0), the larger is the offer. 

Example 8.11. We have seen in Example 1 5 . 5 1 that the cooperative equilibrium without 
altruism of the Ultimatum game is that the proposer offers 0.25 and the responder ac- 
cepts. Nevertheless, it has been reported that most of proposers actually propose a share 
closer to 0.5. This can be explained taking into account altruism. Indeed, if we model 
the set of strategies of the proposer using the set S = {0.00, 0.01, 0.02, . . . , 1.00}, then 
in the induced game lnd{G,pc), the strategy 0.25 is super-dominated for the proposer 
by 0.26, which is super-dominated by 0.27 and so forth. As in the previous example, 
some of these strategies are unplayable of the second type and therefore, altruism can 
explain why offers are tipically larger than 0.25. 

(2) 

Let UnPl- (Q) be the set of player z's unplayable strategies of the second type and 

denote by Plf ^ (G) :=Plf ^ (^)\UnPlf ^ (Q) . This set is weh defined and non-empty thanks 

to the obvious analogue of Lemma 18.81 

Now, we start an iteration of this procedure: we consider the subgame G2 of Q 

(2) 

defined by the strategy sets PI- (Q) and we reduce again these strategy sets computing 

the unplayable strategies of the two types; in this way, we get other sets of playable 
(2) 

strategies Pl)^^(^2); and we start again the procedure. By finiteness of the strategy sets 

(2) (2) 

Si, this iteration stabilizes, that is, at some step k, one has have PI) {Gk) =Ph (^fc+i) 
and this set is clearly non-empty. We set Plj := PI- (Gk)- 

Definition 8.12. The set Plj is called set of playable strategies of player i. 

Before starting the second step of the construction, that is, the prospect theoretical 
analogue of Section [H we give more details about the game introduced in Example 18.31 
Indeed, this game seems interesting from several viewpoints. First, it is an example 
where the procedure of elimination of unplayable strategies stabilizes after more than 
one step. Then, it is one more example where iterated regret minimization theory 
fails to predict the intuitively right behavior, whereas the cooperative equilibrium does 
apparently the right job. Finally, it is an example where super-dominated strategies turn 
out to be helpful to modify iterated regret minimization theory allowing prior beliefs 
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and consequently obtaining the right prediction also under iterated regret minimization 
theory. 

Example 8.13. Consider the same two-person zero-sum game as in Example 18.31 that 
is, the game with gain matrix 



Assume that 012(1, 1, —1) < 1. Observe that L super-dominates R and that I/2(i?, L) = 
0. Consequently, R is unplayable of the first type. On the other hand, in this first step 
U and D are not ordered and therefore, the first step of the iterated deletion leads to the 
subgame Q2 where the vertical player still has both strategies U and D available, whereas 
the horizontal player has only the strategy L. Therefore, in the game G2, the strategy 
U is unplayable of the second type for the column-player (since 012(1, 1, —1) < 1) and, 
consequently, one more application of deletion of unplayable strategies leads to the triv- 
ial game where the vertical player has only the strategy D and the horizontal player 
has only the strategy L. Therefore, (D,L) is the unique cooperative equilibrium of this 
game. Observe that this is also a Nash equilibrium. The other Nash equilibrium is 
[D, jqL + jqR), as one can easily check, which is quite unreasonable, since there is no 
reason why the horizontal player should play R: playing L she will certainly get at least 
the same as playing R. Therefore, the cooperative equilibrium coincides with the most 
reasonable Nash equilibrium. 

On the other hand, a direct application of the iterated regret minimization procedure 
predicts that the vertical player plays U surely. This is also quite unreasonable, because 
playing U makes sense only if the column-player plays R. This cannot happen, above 
all if the column-player understands that the row-player is going to play U. As sug- 
gested by Halpern in a private communication, one can fix this problem allowing prior 
beliefs, in a conceptually similar way as in [Ha-Pal2] . Section 3.5: first one eliminates 
weakly dominated strategies, then applies iterated regret minimization. Nevertheless, 
this procedure is questionable on one point: it is not clear why one should eliminate 
weakly dominated strategies in this context and not in the Traveler's dilemm43. One 
can fix this problem using super-domination. If one eliminates super-dominated strate- 
gies in the game under consideration before applying iterated regret minimization, one 
finds the right solution (D,L), coherently with the classical theory and the cooperative 
equilibrium. Moreover this is perfectly coherent with the other examples discussed in 
[Ha-Pal2] and in particular with the Traveler's dilemma: the Traveler's dilemma has 
many weakly dominated strategies, but none of them is super-dominated. 

9. The cooperative equilibrium under cumulative prospect theory 

In this section we finally define the cooperative equilibrium for games in explicit form 
G = 0{P, S, Q, g, a, f) in complete generality. 

In the previous section we have restricted the sets of pure strategies and we have 
defined the sets of playable strategies Plj. We denote by Red(^) this reduced game, that 
is, the subgame of Q defined by the strategy subsets Plj. The cooperative equilibrium of 
Q (under prospect theory and taking into account altruism) will be obtained by applying 
the construction described in Section H] to the reduced game Red(^) and making use 



L 

U 0,0 
D 1,-1 



R 

10,-10 
1,-1 



If one eliminates weakly dominates strategies in the Traveler's dilemma before applying iterated regret 
minimization, one obtains the Nash equilibrium. 
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of cumulative prospect theory. To this end, notice that the construction presented in 
Section H] depends on expected utihty theory only on two points: 

(1) We have used expected utility theory to compute the value of the prospect 

(ei,j(p),Ti,j(p)) 

indexed by J C P \ {i}. Using cumulative prospect theory, the value that we 
denoted Vi{p) should be replaced by its prospect theoretical analogue 

v?''^ip)= E <^^Ap))^n.Ap)■ (13) 

JCP\{i} 

Since the value v{x) represents how the players perceive a gain of x, also the 
definition of the induced game should be modified: indeed we should allow only 
the profiles of strategies a such that v{gi{a)) > uf^'^(p). Consequently, the 
two applications of the function the first in the computation of v^^"^ and the 
second in the definition of the induced game, are somehow inverse. Indeed, if 
V were linear and increasing, the induced game would have been the same as 
the one obtained by setting v{x) = x. Now, we know from cumulative prospect 
theory that v is strictly increasing. Approximating it by a linear function we can 
simplify a lot the definition setting v{x) = x. This explains why the examples 
in Section [6] fit the experimental data very well: they have been conducted 
with relatively small monetary outcomes and there were no possible losses. Of 
course, it is predictable that in case of possible large gains and/or losses, this 
approximation will create problems. 

(2) The definition of the value of a coalition and then the definition of the coop- 
erative equilibrium rely in the computation of Nash equilibria of the games Gp 
and lnd{Q,p). The computation of Nash equilibria uses expected utility theory, 
precisely in the definition of the mixed extension of the gain functions. Unfortu- 
nately, the natural translation of Nash equilibrium in the language of cumulative 
prospect theory leads to define an object that might not exist (see |Cr90j and, 
more generally, |Fi-PalO] ). To avoid this problem we consider a solution concept 
which is a bit more general than Nash equilibrium, the so-called equilibrium in 
beliefs, introduced by Crawford in [Cr90j . Crawford's equilibria in beliefs have 
the good property to exist in our context, contain all Nash equilibria, and reduce 
to Nash equilibria in many cases. The remainder of the section is devoted to 
this. 

Before recalling the definition of an equilibrium in beliefs, we need to do a prelimi- 
nary step, that is writing the mixed extension of the gain functions in the language of 
cumulative prospect theory. Since notation will get complicated very soon, we start by 
an example. 

Example 9.1. Consider the (already reduced) game with gain matrix: 

C D 
C 2,2 0,3 
D 3,0 1,1 

Assume that the column-player (player 1) plays the mixed strategy ai = |C -|- |-D and 
player 2 plays the mixed strategy (T2 = \C + \D. Under expected utility theory, we 
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would have 

51(0-1, 0-2)= ^ gi{x,y)ai{x)a2{y). 

Let us compute step by step this number to put in evidence where and how expected 
utility theory must be replaced by cumulative prospect theory. Fix a2 as before and 
observe that we have a finite family of prospects, one for each pure strategy of the first 
player. In this example, they are: 

,(c,„, = (2,i;0,2) and = (3, 1; 1, . 

Now, under expected utility theory (and this is the first point where expected util- 
ity theory is used), one computes the values of the two prospects, obtaining, in this 
particular example, the values 

Vi{C,a2) = 2.1 + 0-^ = ^ and Vi{D,a2) = 3 • ^ + 1 • ^ = | 

Of course, these numbers are equal to the ones that are usually denoted by 5i(C, (T2) 
and §2(0, a2), respectively. Now, to compute the value usually denoted by gi{ai,a2), 
one first constructs one more prospect using the measure cJi, that is 

^ V2'8'2'8y' 

and finally, again under expected utility theory, one computes the value of this prospect, 
obtaining the well known value 5i(o"i,o"2). 

We want to replace the classical values gi{ai,a-i) with new values Vi{ai,a-i), ob- 
tained replacing expected utility theory with cumulative prospect theory. Prom the 
example, it is clear that, to compute Vi{ai,a-i) in cumulative prospect theory, we only 
need to compute first Vi{si,a-i), for all Si G Plj, using cumulative prospect theory on 
the prospects and then compute Vi{ai,a2) using cumulative prospect theory 

on the prospect p('^i'°'~i\ To make this idea formal, recall that in cumulative prospect 
theory the outcomes of a prospect are supposed to be ordered in increasing way. It 
is then useful to associate to each prospect p = {xi,pi; . . . ;Xn,Pn), with distinct out- 
come^ Xj € M, a permutation p{p) that is just the permutation of the Xj's such that 
p{p){xi) < p{p){xi^i), for all i. Now for all (sj,s_j) G Plj x Pl_i, we define 

A'f-'-'^ ■= {s[ e PU ■■ 9iisi, s.i) = giisi, s'_i)} ■ 

For any fixed Si, the sets Ai^s form a partition of Pl_j. Choose a transversal Tg^ for this 
partition, that is, 7^- is a subset of Pl_i constructed picking exacty one point for each 
set Ai. Now fix ((Ji,(j_i) G 'P(Plj) X ■p(Pl-i) and define the prospect 

where S-i runs over the transversal 7^.. Of course, this prospect does not depend on 
the particular transversal we fixed. Now, the outcomes of this prospect might not be 
ordered in increasing way. Therefore, before applying cumulative prospect theory to 
compute K we must apply the permutation p (^p^^^'°'~^^) . Consequently, 



If this prospect does not contain the zero-payoff, we add it with probabiUty zero. 
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with the notation as in Section [71 we obtain 

To construct the second prospect we follow an analogous procedure. Let 

The -Bj's form a partition of Plj. Let 7^_. be a transversal for this partition. We define 
the prospect 

where Si runs over 7^_^. Therefore we obtain 



Vi{ai,a-. 




One is now tempted to define a Nash equilibrium of a game under cumulative prospect 
theory as a profile (ci, . . . ,c7Ar) of mixed strategies such that for all z € P and for all 
a[ G T^iSi) one has Vi{ai,a^i) > Vi{a^,a^i). As mentioned before, unfortunately, 
there are games without Nash equilibria in this sense. To avoid this problem, we use 
Crawford's trick to extend the set of Nash equilibria including the so-called equilibria 
in beliefs. To do that, first we recall the following classical definition. 

Definition 9.2. Let "D C be a convex set and let ^ : 2? — ?> M be a function. The 
upper contour set of </> at a E M is the set 

U^{a) = {x : (j){x) > a} . 

g is called quasiconcave on T> if U^{a) is a convex set for all a E M. 

The following definition appeared in |Cr90) . Definition 3. In this definition the word 
game is used to denote a classical finite game in normal form Q = Q{P, S, u), where the 
utility functions are extended to the mixed strategies in a possibly non-linear manner. 

Definition 9.3. The convexified version of a game is obtained from the game by re- 
placing each player's preferences by the quasiconcave preferences whose upper contour 
sets are the convex hulls of his original upper contour sets, leaving other aspects of the 
game unchanged. 

We now define Crawford's equilibria in beliefs through an equivalent condition proved 
by Crawford himself in |Cr90j . Theorem 1. 

Definition 9.4. An equilibrium in beliefs is any Nash equilibrium of the convexified 
version of the game. 

Crawford proved in |Cr90j . Observation 1, that a Nash equilibrium is always an 
equilibrium in beliefs and, in Observation 2, that the set of equilibria in beliefs coincides 
with the set of Nash equilibria if the players have quasiconcave preferences. 

We can now define the cooperative equilibria of a game in explicit form. 

Definition 9.5. The cooperative equilibria of a game in explicit form Q = G{P, S, q, g, a, /) 
are obtained applying to the reduced game Red(^) the procedure described in Section 
m replacing 
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• the function gi{cr) with the function Vi{a), 

• the notion of Nash equihbrium with the notion of equihbrium in behefs, 

• the value function Vi{p) in ([T|) with the one in (15). 

Theorem 9.6. Cooperative equilibria exist for all finite games in explicit form. 

Proof. Let Q = Q{P,S,Q,g,a, f) be a finite game in exphcit form. We have aheady 
proved in Section [5] that the iterated deletion of strategies leads to a well defined and 
non-empty subgame Red(^). We shall prove that the construction in Section U] can be 
applied to Red(^). 

Fix a coalition structure p and let Qp be the game obtained by Red(^) grouping 
together the players in the same coalition, as in Equation By Crawford's theorem 
(see |Cr90j . Theorem 2), the set of equilibria in beliefs of Qp is not empty. Indeed, this 
is just the set of Nash equilibria of the convexified game. Now, since the preferences in 
cumulative prospect theory are described by a continuous function and since continuity 
is preserved by passing to the convexified version (see [RoTOj . Theorem 17.2), it follows 
that the set of equilibria in beliefs of Gp is compact. Consequently, the sets M{pa,p) 
in Equation ([3]) are non-empty and the definition of the induced game lnd{G,p) goes 
through. Observe that the induced game is not empty, since the value of a prospect is at 
most as the maximal outcome of the prospect, which is an infimum of values attained by 
the composed function v o Vi. Therefore, the set of cj's such that [v o Vi){a) > v^'^'^{a) 
is non-empty. Consequently, the set of mixed strategies of the induced game is a non- 
empty convex and compact subset of the set of mixed strategies of the original game Q. 
Since in the convexified version of a game the set of mixed strategies does not change, the 
convexified version of lnd{Q,p) has a non-empty set of Nash equilibria (Indeed, observe 
that Nash's proof of existence of equilibria goes through also if only distinguished convex 
and compact subsets of mixed strategies are allowed). Applying Theorem 1 in |Cr90| . 
it follows that the induced game lnd(Q,p) has a non-empty set of equilibria in beliefs. 
Hence, Definition 14.141 defines a non-empty notion of equilibrium. 

Consequently, Definition 19.51 defines a non-empty notion of equilibrium. □ 

The following corollary follows straight from the construction. 

Corollary 9.7. The exact cooperative equilibrium of a game Q does not depend on the 
fairness functions and on the altruism parameters, if 

(1) Q does not have any super- dominated strategies, 

(2) for every coalition structure p, the game Qp has a unique equilibrium in beliefs. 

Remark 9.8. Also in this case we may define the quantal cooperative equilibrium 
under cumulative prospect theory and taking into account altruism: agent i plays 

CPT / \ CPT / \ 

with probability e"^ / e^^ a quantal level-k solution of the induced game 
Ind(Red(^),p). Such quantal cooperative equilibrium explains deviations from Nash 
equilibrium that have been observed also in purely competitive games, as the asymmet- 
ric matching pennies experimented in [Go-HoOT] . that is, the game with gains: 

L R 
U 320,40 40,80 
D 40,80 80,40 

It was reported in |Go-HoOT] that most of vertical players played the strategy U and 
most of the horizontal players played the strategy R. Observe that the Nash equilibrium 
for the vertical player is the uniform measure on {U, D}, since the gains of the horizontal 
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player are the same as in the matching pennies. We beheve that this behavior ultimately 
relies in a mistake of the vertical players due to the illusion of a large gain and this 
mistake is predicted by the horizontal player. This interpretation is confirmed by the 
cooperative equilibrium. Indeed, the value of the cooperative coalition is easily seen to 
be equal to 40 for both players and, therefore, exact cooperative equilibrium reduces 
to the Nash equilibrium and quantal cooperative equilibrium reduces to the quantal 
level-k solution. The latter one performs well in such a situation: if the vertical player 
makes the mistake to think that the horizontal player is level-0 and then she or he 
is indefferent between playing L and R, then the vertical player would have a strong 
incentive to play the strategy U . At this point, the assumption that the horizontal 
player is level-2 implies that she or he best responds (up to a small mistake) to the 
strong deviation towards C/, which is a strong deviation towards R. 

10. Summary, conclusions and open problems 

Over the last decades it has been realised that all classical solution concepts for one- 
shot normal form games fail to predict human behavior in several strategic situations. 

The purpose of this paper was to attribute these failures to two basic problems, the use 
of utility functions and the use of solution concepts that do not take into account human 
attitude to cooperation. While the former problem could be theoretically overcome 
replacing utility functions by gain functions and applying cumulative prospect theory, 
the second problem needs a different analysis of the structure of a game. We founded 
this new analysis on a seemingly reasonable principle of cooperation. 

(C) Players try to forecast how the game would be played if they formed coalitions 
and then they play according to their best forecast. 

To make this idea formal, it has required some effort. In Section [2] we have observed 
that passing from utility functions to gain functions implies that we must take into 
account new phenomena, such as altruism and perception of gains. We have formal- 
ized these phenomena defining the so-called games in explicit form. After an example 
describing informally the main idea, in Section S] we have formalized the principle of 
cooperation and we have defined the cooperative equilibrium for games in explicit form 
without using altruism parameters and cumulative prospect theory. The reason of this 
choice is that altruism and cumulative prospect theory play an active role only on a 
limited class of games. Indeed, in Section [S] we have shown that the cooperative equi- 
librium without altruism and cumulative prospect theory already performs well in a 
number of relevant games. In Section Owe have discussed a few examples where cumu- 
lative prospect theory starts playing an active role and, after a short introduction to 
cumulative prospect theory in Section [71 we have started to adapt the definition given of 
cooperative equilibrium given in Definition 14. 141 in order to be applied to every game in 
explicit form and using cumulative prospect theory. In Section [8] we have used altruism 
parameters to delete strategies that are not good for the collectivity. This iterated dele- 
tion of strategies leads to define a certain subgame. The study of this subgame (done 
in Section d] under expected utility theory and in Section [9] under cumulative prospect 
theory) contains all relevant new ideas of the paper, that are, the use of the principle 
of cooperation and the use of cumulative prospect theory: we have assumed that every 
players try to forecast how the game would be played if they formed coalition; we have 
used cumulative prospect theory to define a notion of value of a coalition and then, ap- 
pealing to some Bernoulli-type principle, we have postulated that agents play according 
to the coalition with highest value. 
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As shown in the examples in Section [Sj the theory has many positive consequences: to 
the best of our knowledge, it is the first theory able to organize the experimental data 
collected for the Traveler's Dilemma, Prisoner's Dilemma, Nash bargaining problem, 
Bertrand competition, public good game, ultimatum game, and dictator game. These 
successful applications and the lack of examples where the cooperative equilibrium fails 
(qualitatively) to predict human behavior, make us optimistic about this direction of 
research. Nevertheless, we are perfectly aware that the theory is questionable in several 
points which deserve more attention in future researches. These points include: 

(1) To understand if there are other parameters to be taken into account in the 
definition of games in explicit form. In particular, there is some evidence that 
badness parameters can play an important role in some situations, one of which 
is described in the following point. 

(2) To understand what happens if the players do not agree in playing according 
to the same coalition structure. Indeed, the cooperative equilibrium works very 
well in all examples we have discussed since there is a unique coalition structure 
p that maximizes the value of all players. What happens if different players 
have different coalition structures maximizing their own value? Do all players 
defect and play according to the coalition structure generated by the maximiz- 
ing coalition structures, that is, the coarsest coalition which is finer than all 
maximizing coalition structures? Or, do the players agree to play the fairest 
coalition structures? In this latter case, what happens if there are many fairest 
coalition structures? Do the players play uniformly among them? 

The difficulty in understanding this point is due mainly to the lack of rele- 
vant examples where this situation happens. In fact, we are aware of only one 
example, where this situation is about to happen. We construct this game taking 
inspiration by a similar game recently experimented in |We-Raj . Two players 
have the same strategy set 5i = 82 = S = {0, 1, ... , 30}. The gain functions 
are as follows: 



Let us compute the cooperative equilibrium of this game. The unique equi- 
librium of the cooperative coalition structure pc = {1,2} is (0,0), where both 
players get 30. Observe that no players have incentive to deviate from this 
equilibrium and consequently, the values of pc are 



Now consider the selfish coalition structure ps = ({1},{2}). The value for 
the second player is again ^2(^8) = 30, whereas this time one gets fi(ps) = 15. 
Indeed, this is one of the cases where the natural symmetry of the game implies 
that we can restrict the set M{{2},ps) taking its barycenter. In other words, 
when player 2 plays according to ps , she is indifferent among her choices and so 
she plays uniformly. Player I's best reply to player 2's uniform measure is the 
uniform measure, that gives payoff 15. Since this is a Nash equilibrium, there 
are no possible deviations and so V2{ps) = 15. 

So in this case, the unique cooperative equilibrium is (0,0). In other words, 
player 2 favors player 1 playing and player 1 knows that player 2 is going to 
favor her and so she plays as well. This seems a very natural solution but: Do 
humans really play (0, 0) ? 




and 





and 
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We tried to simulate this game with colleagues and friends and something 
interesting apparently came out. One friend, asked to play the game in the role 
of player 2, said: "/i depends. If player 1 is very rich, I would play 30 for sure!" . 
The most common question we were asked after explaining the game was: "Do 
/ know the other player?" . After asking to imagine an anonymous situation, the 
most common answer (nine out of ten) was: "PF/iy should I hurt a person that 
I do not know? I would play 0.". One person said: "/ don't care! I would pick 
a number randomly" . 

Of course, these cannot be considered as experimental data, but we believe 
that they represent however a light evidence that badness parameters do exist. 
It is not yet clear to the author how to manage them from a general point of 
view and we will postpone the theorization to a new paper hopefully helped by 
more experimental data. However, we can say right now how these parameters 
would effect the play of this particular game. We guess that the badness param- 
eters bij are non-negative real numbers, where bij = represents the situation 
where player i is absolutely good against player j, that is, player i favors player j 
whenever possible, and bij = oo represents the situation where player i is abso- 
lutely bad against player j. As said, it is not yet clear which would be the exact 
mathematical definition and the exact effect of these parameters on a general 
game, but the idea is that in this particular version of the Ultimatum game, 
the second player plays according to the parameter 621 and player 1 estimates 
a priori the parameter 621 and plays a best reply to the strategy that player 2 
would play if her badness parameters were equal to player 2's estimation. 

(3) The formula used in Equation ([1]) to compute the value of a coalition seems 
a quite reasonable one and it meets the experimental data quite well, but it 
is certainly only a first tentative. More thoughts, possibly supported by more 
experimental data, may help to understand the value of a coalition. The main 
point is probably: 

• to understand whether the value should be computed taking into account 
also deviations towards safe strategies. 
Indeed, consider the two-player game with gain matrix 

a b 
a 1,1 0,-k 
b -k,0 10,10 

The cooperative equilibrium is (6, b) independently on k. Is this reasonable 
or for k large enough players prefer not to risk and play the safe strategy (a, a)? 

(4) The formula used in Equation ([TJ to compute the value of a coalition is ques- 
tionable on another point. In the definition of the numbers Rj{p), we have 
considered the first step of the reasoning: if player j decides to abandon the 
coalition structure p, then another player, say k, may do the same either to fol- 
low selfish interests or because she or he is clever enough to anticipate player j 
deviation. But, if player j is also clever enough to anticipate player /c's deviation, 
then player j may deviate from the deviation, and so forth. We could continue 
this reasoning and define the risk Rj{p) to be, roughly speaking, the maximal 
lost that player j incurs when a profile of strategies that can be reached by a se- 
quence of deviations is played. Of course, this definition would come at the price 
of a major technical difficulty, but it would be theoretically more appealing, since 
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it would allow to construct a bridge from the cooperative equilibrium theory to 
another well studied behavioral model. We recall that Tj^j(p) has been called 
prior probability, since, despite being an apparently very precise evaluation of 
how player i measures the event ^^players in J abandon the coalition structure 
p" , it is well possible that a specific player i, for personal reasons, evaluates this 
event in a completely different way. In particular, the number Tj represents 
the probability that player i assigns to the event that no players abandon the 
coalition. The types of players that are usually called, in economic literature, 
altruistic (resp. selfish) would then correspond to those players i who compute 
the value of a coalition setting Tj g = 1 (resp. rj = 0), independently of the 
prior value of such a probability. The correspondence between selfish players 
and players who set = fails using the formula in ([TJ , since this formula 
with Tj = can still predict cooperation, even though in a smaller rate, as, for 
instance, in the Traveler's dilemma. 

(5) The exact computation of the cooperative equilibrium is hard for several rea- 
sons. First because it goes through the computation of the equilibria in be- 
liefs of severa E (sub) games. These equilibria are computationally hard to find 
|Da-Go-Pa06| . Second, because it uses cumulative prospect theory, that is com- 
putationally harder than expected utility theory. On one hand, the method that 
we have proposed is perfectly algorithmic and therefore it might be helpful to 
write a computer program to compute the cooperative equilibria and make easier 
the phase of test them on easy real-life situations. On the other hand, it would 
be important to investigate some computationally easier variant. Of course, 
quantal level-k theory can be seen as a computationally easier variant, but this 
theory has the serious issue that it would not be predictive, in the sense that 
one has to conduct experiments to estimate the error parameter. One could try 
to avoid this problem using the level-k theory (i.e., only bounded rationality). 

(6) Iterated deletion of strategies using altruism functions in Section[8]was certainly 
quite sketchy and it is likely that future researches will suggest a different pro- 
cedure. In particular, the definition of unplayable strategies of the second type 
for player i requires that only one particular player j receives a large loss. It is 
possible that this condition is not sufficient to convince player i to renounce to 
her better strategy, in case when the players in P \ {i, j} receives a large gain. 

(7) We have defined altruism functions operationally, meaning that one could theo- 
retically compute them by conducting an experiment on the generalized dictator 
game. It would be important to find an operational way to define the fairness 
functions. 

Open problems include: 

(1) Many experiments with different purposes should be conducted. Indeed, an 
interesting fact is that cooperative equilibrium makes sometimes completely new 
predictions. A stream of experiments should be devoted to verify or falsify these 
predictions. For instance, 

• Apparently, the cooperative equilibrium is the unique solution concept pre- 
dicting an increasing rate of cooperation in the public good game, as the 

^^As observed by J. Halpern in a private communication, it is implausible that an agent would consider 
all coalitions. In even moderately large games, there are just too many of them. She may consider some 
natural coalitions (e.g., the coalition of all agents), but only a relatively small number. Of course, a 
theory characterizing which coalitions would be considered is not easy to come by. 
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marginal return approaches 1. It seems that this prediction has a partial 
confirmation from experimental data, but, as far as we know, only one 
experiment has been devoted to report this behavior, that is, |IWW94] . 
Analogously, it seems that the cooperative equilibrium (under cumulative 
prospect theory) is the unique solution concept predicting or, at least, jus- 
tifying a rate of cooperation in the public good game with a large number 
of players. Also in this case, we are aware of only one experimental study 
devoted to observe this unexpected behavior, that is, again, |IWW94j . 

• Apparently, the cooperative equilibrium is the unique solution concept 
predicting a rate of cooperation in the Prisoner's dilemma depending on 
the particular gains. It seems that this prediction is partially confirmed 
by experimental data, but only on the repeated Prisoner's dilemma (see 
[DRFNOS ] and [ Fu-Ra-Drl2j ). Experiments with a one-shot parametrized 
Prisoner's dilemma should be conducted to verify or falsify this prediction. 

Another stream of experiments should be devoted to answer some theoretical 
questions. At this first stage of research, we believe that the most important 
one is: 

• to understand whether the value of a coalition structure should be computed 
taking into account also deviations towards safe strategies. 

(2) Have a better understanding of the relation between Nash equilibria and cooper- 
ative equilibria (under expected utility theory) for two-person zero-sum games, 
when the players have the same perception of gains. Indeed, Nash equilibrium 
performs quite well for zero-sum games and it is possible that all deviations from 
Nash equilibrium can be explained only making use of cumulative prospect the- 
ory. Therefore, it would be important to understand if the cooperative equilib- 
rium (under expected utility theory and assuming /i = /2) refines Nash equilib- 
rium, in the sense that the set of exact cooperative equilibria is always a subset 
of the set of Nash equilibria. In this context, it would also be interesting to 
start from relevant classes of zero-sum games, as the group games, introduced 
and studied in [MolOj . |Ca-Mol2] . |Ca-Scl2] . Of course, also a counter-example 
would be very important to understand if and where the theory can be modified. 

(3) As stressed several times, the probability Tij{p) is just a prior probability, in the 
sense that it is well possible that a particular player i computes this probability in 
a completely different way. It would be important to understand the factors that 
may influence the evaluation of this probability. For instance, it is well known 
that individual-level rate of cooperation depends on family history, age, culture, 
gender, even university course [Ma- AmST] . religious beliefs [HRZllj . and time 
decision |RGN12] . The dream is to incorporate this factors into parameters to 
use to compute the probability Tj at an individual-level. Particularly interesting 
would also be the study of this probability when players can talk each other or 
have any sort of contact (e.g., eye-contact). Indeed, these contacts can create 
phenomena of mental reading (see [Wi-MN-G Jj ) that we believe can be explained 
in terms of evaluation of the probability r. 
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